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Foreword 



This is a unique book. It covers the entire theoretical and experimental content of 
particle physics. 

Particle physics sits at the forefront of our search for the ultimate structure of 
matter at the smallest scale, but in the process it has also learned to question the 
nature of our space and time in which they exist. Going hand in hand with tech- 
nological advances, particle physics now has extended its reach to studies of the 
structure and the history of the universe. It seems that both the ultimate small and 
the ultimate large are linked together. To explore one you must also explore the 
other. 

Thus particle physics covers a vast area. To master it, you usually have to read 
different books, at increasingly advanced levels, on individual subjects like its his- 
torical background, basic experimental data, quantum field theory, mathematical 
concepts, theoretical models, cosmological concepts, etc. This book covers most of 
those topics in a single volume in an integrated manner. Not only that, it shows 
you how to derive each important mathematical formula in minute detail, then 
asks you to work out many problems, with answers given at the end of the book. 
Some abstract formulas are immediately followed by their intuitive interpretation 
and experimental consequences. The same topics are often repeated at different 
levels of sophistication in different chapters as you read on, which will help deepen 
your understanding. 

All these features are quite unique to this book, and will be most helpful to stu- 
dents as well as laymen or non-experts who want to learn the subject seriously and 
enjoy it. It can serve both as a text book and as a compendium on particle physics. 
Even for practicing particle physicists and professors this will be a valuable refer- 
ence book to keep at hand. Few people like Professor Nagashima, an accomplished 
experimental physicist who is also conversant with sophisticated theoretical sub- 
jects, could have written it. 

Chicago, October 2009 Yoichiro Nambu 
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Preface 



The purpose of this book is to present introductory explanations to junior gradu- 
ate students of major advances in particle physics made in the last century. It is 
also aimed at laypeople who have not received a standard education in physics but 
are willing to make an extra effort to follow mathematical logic. Therefore it is or- 
ganized to be suitable for self-study and is as self-contained as possible. This is 
why it starts with an introduction to field theory while the main purpose is to talk 
about particle physics. Quantum field theory is an essential mathematical tool for 
understanding particle physics, which has fused quantum mechanics with special 
relativity. However, phenomena that need the theory of relativity are almost com- 
pletely limited to particle physics, cosmology and parts of astrophysics, which are 
not topics of everyday life. Few students dare to take a course in special or gen- 
eral relativity. This fact, together with the mathematical complexity of field theory, 
makes particle physics hard to approach. If one tries to make a self-contained expla- 
nation of mathematical aspects of the field, hardly any space is left for the details of 
particle-physics phenomena. While good textbooks on field theories are abundant, 
very few talk about the rich phenomena in the world of elementary particle physics. 
One reason may be that the experimental progress is so rapid that any textbook that 
tries to take an in-depth approach to the experimental aspect of the field becomes 
obsolete by the time it is published. In addition, when one makes a great effort to 
find a good book, basic formulae are usually given without explanation. The au- 
thor had long felt the need for a good textbook suitable for a graduate course, well 
balanced between theory and experiment. 

Without doubt many people have been fascinated by the world that quantum 
mechanics opens up. After a student has learned the secrets of its mystery, he 
(or she) must realize that his world view has changed for life. But at what depth 
would it have been if he thought he had grasped the idea without knowledge of 
the Schodinger equation? This is why the author feels uneasy with presentation of 
the basic formulae as given from heaven, as is often the case in many textbooks on 
experimental particle physics. 

This book is the first of two volumes. Volume I paves the way to the Standard 
Model and Volume II develops applications and discusses recent progress. 

Volume I, i.e. this book, is further divided into two parts, a field theoretical ap- 
proach and the way to the Standard Model. The aim of Part I is to equip students 
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with tools so they can calculate basic formulae to analyze the experimental data 
using quantized field theories at least at the tree level. Tailored to the above pur- 
pose, the author's intention is to present a readable introduction and, hopefully, an 
intermediate step to the study of advanced field theories. 

Chapters 1 and 2 are an introduction and outline of what the Standard Model is. 
Chapter 3 picks out essential ingredients of special relativity relevant for particle 
physics and prepares for easy understanding of relativistic formulae that follow. 
Chapters 4—7, starting from the Dirac equation, field quantization, the scattering 
matrix and leading to QED (quantum electrodynamics), should be treated as one, 
closed subject to provide basic knowledge of relativistic field theories and step-by- 
step acquisition of the necessary tools for calculating various dynamic processes. 
For the treatment of higher order radiative corrections, only a brief explanation is 
given in Chapter 8, to discuss how mathematical consistency of the whole theory 
is achieved by renormalization. 

Chapter 9 deals with space-time and internal symmetry, somewhat independent 
topics that play a double role as an introduction to the subject and appendices to 
particle physics, which is discussed in Part II. 

Two chapters, 10 and 11, on the path integral are included as an addition. The 
method is very powerful if one wants to go into the formal aspects of field theory, 
including non-Abelian gauge theories, in any detail. Besides it is the modern way 
of interpreting quantum mechanics that has many applications in various fields, 
and the author felt acquaintance with the path integral is indispensable. However, 
from a practical point of view of calculating the tree-level formulae, the traditional 
canonical quantization method is all one needs. With all the new concepts and 
mathematical preparation, the chapter stops at a point where it has rederived what 
had already been calculated before. So readers have the choice of skipping these 
chapters and coming back later when their interest has been aroused. 

By jumping to Chapter 18 which describes axioms of the Standard Model, the 
field theoretical approach can be concluded in a self-contained way, at least formal- 
ly. But it was placed at the end of Part II because the phenomenological approach 
also culminates in the formulation of the unified gauge theories. Although two al- 
ternative approaches are prepared to reach the goal, the purpose of the book is to 
present particle physics, and field theory is to be considered as a tool and treated as 
such. 

As part of the minimum necessary knowledge, a chapter was added at the end of 
Part I on basic techniques of experimental measurements, the other necessary tool 
to understand particle physics phenomena. Readers who are perplexed with the 
combination may skip this chapter, but it is the author's wish that even students 
who aim at a theoretical career should understand at least this level of experimental 
techniques. After all, the essence of natural science is to explain facts and theorists 
should understand what the data really mean. In addition, a few representative 
experiments are picked out and scattered throughout the book with appropriate 
descriptions. They were chosen for their importance in physics but also in order to 
demonstrate how modern experiments are carried out. 



Preface I 

In Part II, various particle phenomena are presented, the SU(3) group is intro- 
duced, basic formulae are calculated and explanations are given. The reader should 
be able to understand the quark structure of matter and basic rules on which the 
modern unified theory is constructed. Those who do not care how the formulae 
are derived may skip most of Part I and start from Part II, treating Part I as an 
appendix, although that is not what the author intended. Part II ends when the 
primary equations of the Standard Model of particle physics have been derived. 

With this goal in mind, after presenting an overview of hadron spectroscopy in 
Chapter 13 and the quark model in Chapter 14, we concentrate mainly on the elec- 
tro-weak force phenomena in Chapter 15 to probe their dynamical structure and 
explain processes that lead to a unified view of electromagnetic and weak inter- 
actions. Chapter 17 is on the hadron structure, a dynamical aspect of the quark 
model that has led to QCD. Chapter 18 presents the principles of gauge theory; its 
structure and physical interpretation are given. Understanding the axioms of the 
Standard Model is the goal of the whole book. 

Chapter 16 on CP violation in K mesons is an exception to the outline described 
above. It was included for two reasons. First, because it is an old topic, dating back 
to 1964, and second, because it is also a new topic, which in author's personal per- 
ception may play a decisive role in the future course of particle physics. However, 
discussions from the viewpoint of unified theories is deferred until we discuss the 
role of B-physics in Volume II. Important though it is, its study is a side road from 
the main route of reaching the Standard Model, and this chapter should be con- 
sidered as a topic independent of the rest; one can study it when one's interest is 
aroused. 

Although the experimental data are all up-to-date, new phenomena that the Stan- 
dard Model predicted and were discovered after its establishment in the early 1970s 
are deferred to Volume II. The last chapter concludes what remains to be done on 
the Standard Model and introduces some intriguing ideas about the future of par- 
ticle physics. 

Volume II, which has yet to come, starts with the axioms of gauge field theo- 
ries and presents major experimental facts that the Standard Model predicted and 
whose foundations were subsequently established. These include W, Z, QCD jet 
phenomena and CP violation in B physics in both electron and hadron colliders. 
The second half of Volume II deals issues beyond the Standard Model and recent 
developments of particle physics, the Higgs, the neutrino, grand unified theories, 
unsolved problems in the Standard Model, such as axions, and the connection with 
cosmology. 

A knowledge of quantum mechanics and the basics of special relativity is re- 
quired to read this book, but that is all that is required. One thing the author aimed 
at in writing this series is to make a reference book so that it can serve as a sort 
of encyclopedia later on. For that reason, each chapter is organized to be as self- 
contained as possible. The list of references is extensive on purpose. Also, as a re- 
sult, some chapters include sections that are at higher level than that the reader has 
learned up to that point. An asterisk * is attached to those sections and problems. 



If the reader feels they are hard, there is no point in getting stuck there, skip it and 
come back later. 

This book was based on a series of books originally written in Japanese and pub- 
lished by Asakura Shoten of Tokyo ten years ago. Since then, the author has re- 
organized and reformulated them with appropriate updating so that they can be 
published not as a translation but as new books. 

Osaka, October, 2009 Yorikiyo Nagashima 
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Part One A Field Theoretical Approach 



1 

Introduction 



An Overview of the Standard Model 

1.1.1 

What is an Elementary Particle? 

Particle physics, also known as high-energy physics, is the field of natural science 
that pursues the ultimate structure of matter. This is possible in two ways. One is to 
look for elementary particles, the ultimate constituents of matter at their smallest 
scale, and the other is to clarify what interactions are acting among them to con- 
struct matter as we see them. The exploitable size of microscopic objects becomes 
smaller as technology develops. What was regarded as an elementary particle at 
one time is recognized as a structured object and relinquishes the title of "elemen- 
tary particle" to more fundamental particles in the next era. This process has been 
repeated many times throughout the history of science. 

In the 19th century, when modern atomic theory was established, the exploitable 
size of the microscopic object was ~ 10 -10 m and the atom was "the elementary 
particle". Then it was recognized as a structured object when J.J. Thomson extract- 
ed electrons in 1897 from matter in the form of cathode rays. Its real structure (the 
Rutherford model) was clarified by investigating the scattering pattern of a -parti- 
cles striking a golden foil. In 1932, Chadwick discovered that the nucleus, the core 
of the atom, consisted of protons and neutrons. In the same year, Lawrence con- 
structed the first cyclotron. In 1934 Fermi proposed a theory of weak interactions. 
In 1935 Yukawa proposed the meson theory to explain the nuclear force acting 
among them. 

It is probably fair to say that the modern history of elementary particles began 
around this time. The protons and neutrons together with their companion pions, 
which are collectively called hadrons, were considered as elementary particles un- 
til ca. 1960. We now know that they are composed of more fundamental particles, 
the quarks. Electrons remain elementary to this day. Muons and r-leptons, which 
were found later, are nothing but heavy electrons, as far as the present technolo- 
gy can tell, and they are collectively dubbed leptons. Quarks and leptons are the 
fundamental building blocks of matter. The microscopic size that can be explored 
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by modern technology is nearing 10 -19 m. The quarks and leptons are elementary 
at this level. They may or may not be at a smaller level. The popular string theory 
regards the most fundamental matter constituent not as a particle but as a string 
at the Planck scale (~ 10 -35 m), but in this book we limit ourselves to treating only 
experimentally established facts and foreseeable extensions of their models. 

1.1.2 

The Four Fundamental Forces and Their Unification 

Long-Range Forces: Gravity and the Electromagnetic Force There are four kinds of 
fundamental forces in nature, the strong, electromagnetic, weak and gravitational 
forces. The strong force, as the name suggests, is the strongest of all; it is a thou- 
sand times stronger, and the weak force a thousand times weaker, than the electro- 
magnetic force. The gravitational force or gravity is negligibly weak (that between 
the proton and electron is 10 -42 times the electromagnetic force) at elementary 
particle levels. Its strength becomes comparable to the others only on an extreme- 
ly small scale (Planck scale ~ 10 -35 m), or equivalently at extremely high energy 
(Planck energy ~ 10 19 GeV), not realizable on the terrestrial world. Despite its in- 
trinsic weakness, gravity controls the macroscopic world. Indeed, galaxies hundred 
millions of light years apart are known to be mutually attracted by the gravita- 
tional force. The strength of the force decreases (only) inversely proportional to 
the distance squared, but it also scales as the mass, and the concentration of the 
mass more than compensates the distance even on the cosmic scale. We refer to it 
as a long-range force. The behavior of the electromagnetic force is similar to and 
far stronger than gravity, but, in general, the positive and negative charges com- 
pensate each other. However, its long-distance effect manifests itself in the form 
of the galactic magnetic field, which spans hundreds or possibly millions of light 
years. It binds nuclei and electrons to form atoms, atoms to form molecules and 
molecules to form matter as we observe them. Namely, it is the decisive force in 
the microworld where the properties of matter are concerned. Atomic, molecu- 
lar and condensed matter physics need only consider the electromagnetic force. A 
mathematical framework, called quantum electrodynamics (QED), is used to de- 
scribe the dynamical behavior of point-like charged particles, notably electrons, the 
electromagnetic field and their interactions. Mathematically, it is a combination of 
quantized Maxwell equations and relativistic quantum mechanics. 

Short-Range Forces: The Strong and Weak Forces In the ultramicroscopic world of 
the nuclei, hadrons and quarks (at scales less than ~ 10 -15 m), both the strong and 
the weak force come in. The reason why they are important only at such a small 
scale is that they are short ranged reaching only a few hadron diameters, and ac- 
cumulation of the mass does not help to make it stronger. They are referred to as 
short-range forces. The strong force (referred to as the color force at the most fun- 
damental level) acts between quarks to bind them to form hadrons. Historically, the 
strong force was first discovered as the nuclear force to bind protons and neutrons. 
But as they were found to be composites of the quarks, the nuclear force is recog- 
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nized as a kind of molecular force (van der Waals force) that can be derived from 
the more fundamental color force. In 1935, Yukawa predicted the existence of the 
ji meson as the carrier of the nuclear force. The idea that the force is transmitted 
by a "force carrier particle" was revolutionary and laid the foundation for present- 
day gauge theories. Later, it was clarified that the pion was a composite and cannot 
be a fundamental force carrier, but the basic idea remains valid. The weak force is 
known to act in the decay of hadrons, notably in nuclear /3 decays. It is also known 
to control the burning rate of the sun and to play a decisive role to in the explosion 
of type II supernovae. 

The electromagnetic force, though playing an essential role at the microscopic 
level, can also act at the macroscopic distance. We observe electromagnetic phe- 
nomena in daily life, for example electrically driven motors or the reaction of a tiny 
magnet to the earth's magnetic field. The reason is that its strength is proportion- 
al to the inverse square of the distance (Coulomb's law). In general, a long-range 
force decreases only as the inverse power of the distance (~ r~ n ) and can act on an 
object no matter how far away it is if there is enough of the force source. A short- 
range force, on the other hand, is a force whose strength diminishes exponentially 
with the distance. The nuclear force, for instance, behaves like ~ e _r ' r °/r 2 (r ~ 
10 — 15 m) and reaches only as far as r < r . The range of the weak force is even 
shorter, of the order of ~ 10 -18 m. Because of this, despite their far greater strength 
than gravity, their effects are not observed in the macroscopic world. However, if 
one can confine enough matter within the reach of the strong force and let it re- 
act, it is possible to produce a large amount of energy. Nuclear power and nuclear 
fusion are examples of such applications. 

When a Long-Range Force Becomes a Short-Range Force The above description em- 
phasizes the difference between the properties of the forces. Modern unified theo- 
ries tell us, however, that all the fundamental forces, despite their variety of appear- 
ances, are essentially long-range forces and can be treated within the mathematical 
framework of gauge theories. The seemingly different behavior is a result of meta- 
morphosis caused by environmental conditions. We introduce here one example of 
a long-range force becoming short range under certain circumstances. A train with 
a linear induction motor, the next generation supertrain, uses a phenomenon called 
the Meissner effect to magnetically levitate its body in the air. There, the magnet- 
ic field is completely repelled from the superconductor. This happens because, at 
very low temperatures near absolute zero, electron pairs in a special configuration 
called the "Cooper pair" collectively react to the magnetic field to form a strong eddy 
current to cancel the invading magnetic flux. The cancellation is complete except 
for a shallow depth at the surface, into which the flux can invade. In other words, 
the electromagnetic force, which is originally long range, becomes short range in 
the superconductor. This example inspired Nambu [287] to formulate an important 
concept known as "spontaneous symmetry breaking", which played a crucial role 
in forming the Standard Model. 



Higgs Mechanism The essence of modern unified theory is to recognize that "the 
vacuum in which we live is in a sort of superconducting phase". What corresponds 
to the Cooper pair in the superconductor is called the Higgs particle, unseen by 
us but thought to permeate everywhere in the universe. The weak force is long 
range originally, but because of the ultracold vacuum that surrounds us becomes 
short range, owing to the pseudo-Meissner effect caused by the Higgs particles. 
The short-range force translates to a finite mass of the force carrier particle in 
quantum field theory, as we shall see in the following sections. The Higgs parti- 
cle condenses at the ultracold temperature and causes a phase transition of the 
vacuum, a phenomenon similar to the vapor-to-water phase transition. Elementary 
particles, which have zero mass originally, acquire mass because of the dragging 
effect caused by swimming through the condensed sea of the Higgs particle. This 
mass-generating scheme is called the Higgs mechanism. It is thought that at high 
temperature (such as at the hot Big Bang) and before the Higgs condensation the 
weak force was a long-range force, just like the electromagnetic force, and both 
were just different aspects of the same force. The relation between them is similar 
to that between magnetic and electric forces. They are unified in the sense they 
are described by a single mathematical framework, just like the electric and mag- 
netic forces are described by the Maxwell equations. After Glashow, Weinberg and 
Salam, who constructed the unified theory of the electromagnetic and the weak 
forces, it is often called as GWS theory or simply the electroweak theory. 

Confinement and Screening of Color Contrary to the weak force, the strong force 
becomes short range for a different reason. A group of quarks bind together by 
the color force to become hadrons. Protons and pions are examples. At short dis- 
tances, the color force between the quarks is inversely proportional to the square 
of the distance, just like any long-distance forces but at long distances, its behavior 
is different and has a component that does not diminish as the distance increases. 
Therefore, stored potential energy between them increases with distance, hence 
an infinite energy is required to separate them to a macroscopic distance. The 
quarks are, in a sense, connected by a string and, like the north and south poles 
of a magnet, cannot be isolated individually, a phenomenon called "confinement". 
They tend not to separate from each other by more than ~ 10 -15 m, which is the 
typical size of hadrons. When the string is stretched beyond its limit, it is cut into 
many pieces, producing more hadrons. The color charges of the quarks come in 
three kinds dubbed "R, G, B", for red, green and blue, but only a neutral color com- 
bination can form hadrons. 1 ' In the vicinity of the hadron, quarks of different color 
are located at different distances and hence the effect of individual color can be felt. 
But from far away the color looks completely neutralised and the strong force be- 
tween the hadrons vanishes, making it a short-range force. The force between the 
hadrons, including the nuclear force, is an action of partially compensated residual 

1) The name "color" is used as a metaphor for three kinds of force-generating charge. We may 
equally call them by a number or any other index. Only a certain mixture of R, G, B, or color 
with anticolor (complementary color) that makes white can form hadrons. Hence in the color 
terminology we can conveniently say hadrons are color-neutral. 
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colors and is therefore secondary in its nature, like the van der Waals force acting 
between molecules. The action of color forces is, mathematically, almost identical 
to that of the electromagnetic force and is also described by a gauge theory, the 
difference being only in the number of charges (three). 2 ' This is the reason for the 
name quantum chromodynamics (QCD) to describe the mathematical framework 
of the strong force dynamics. 

1.1.3 

The Standard Model 

The weak force can be described by a gauge theory that contains two kinds of 
charge. General relativity, the theory of gravity, was also recognized as a kind of 
gauge theory. Considering that the electromagnetic and weak forces are unified 
and all four fundamental forces work in the same mathematical framework, it is 
natural to consider that all the forces are unified but show different aspects in dif- 
ferent environments. The Grand Unified Theory (GUT), to unify the electroweak 
and strong interactions, and the Super Grand Unified Theory, 3 ' to combine all the 
forces, including gravity, are currently active research areas. Among them "super- 
gravity" and "superstring theories" are the most popular. But only the electroweak 
theory and QCD are experimentally well established and are called collectively "the 
Standard Model" (SM) of elementary particles. The history and current situation of 
force unification are shown in Fig. 1.1. 
The essence of the Standard Model can be summarized as follows: 

1. The elementary particles of matter are quarks and leptons. 

2. The mathematical framework for the force dynamics are gauge theories. 

3. The vacuum is in a sort of superconducting phase. 

Particle species in the Standard Model are given in Table 2.1 of Sect. 2.1. Matter par- 
ticles, the quarks and leptons, are fermions having spin 1/2, and the fundamental 
force carriers, also called gauge particles, are spin 1 bosons. The Higgs boson is a 
spin particle and is the only undiscovered member. Whether it is elementary or 
a dynamical object remains to be seen. The Standard Model was proposed around 
1970 and was firmly established experimentally by the end of the 1970s. The Stan- 
dard Model was constructed on the torrent of data from the 1960s to 1980s, when 
large accelerators become operational (see Table 1.1). It marked the end of the era 
of the 80-year quest for the fundamental building blocks of matter that started with 
Rutherford at the beginning of the 20th century, and opened a new epoch. The 
effect on our consciousness (material view) was revolutionary and comparable to 
that of relativity and quantum mechanics at the beginning of the 20th century. The 

2) Gluons, the equivalent of photons in electromagnetism, come in eight colors, which are 
combinations of the three basic colors, excluding white. 

3) The prefix "super" is attached for theories that o 
fermions and bosons to form one family. They ai 
Model and are briefly described in Chap. 19. 
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Figure 1.1 Unification of the forces. Those enclosed in dashed lines are not yet established. 
Only the strong force changes its strength appreciably within currently available energy ranges 
or, equivalently, distances. This is why the distance of the strong force is specified. 



Standard Model is very powerful and can explain all the phei 
croworld of particles in a simple and unified way, at least in principle. Forty years 
have passed since the Standard Model was established, yet there has appeared on- 
ly one phenomenon that goes beyond the Standard Model. Neutrino oscillation in 
which a neutrino of one kind is transformed to another while it propagates, does 
not happen if the mass of the neutrino vanishes, as is assumed in the Standard 
Model. But it is fair to say it only needs a small stretch and is not a contradiction 
to the Standard Model. The model is so powerful to the extent that it is not easy 
to think of an experiment with currently available accelerators that could challenge 
the Standard Model in a serious way. A quantum jump in the accelerator energy 
and/or intensity is required to find phenomena that go beyond the Standard Model 
and explore new physics. 
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Table 1.1 Chronicle of particle physics. Important discoveries anc 
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Table 1.1 (continued) 



Year 


Names 


Discovered or proposed 


Accelerator 


1964 


M. Gell-Mann 
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The fact that the kinetic energy of particles in particle physics far exceeds the 
mass energy makes special relativity an indispensable tool to describe their behav- 
ior. Understanding the mathematical formalism that is built on the combination of 
special relativity and quantum mechanics, also known as quantum field theory, re- 
quires a fair amount of effort, and the ensuing results very often transcend human 
n sense. The only other fields that need both disciplines are cosmology and 
e phenomena in astrophysics, such as supernovae and black-hole dynam- 



The Accelerator as a Microscope 

In order to investigate the microscopic structure of matter, we need a tool, a micro- 
scope, that magnifies the object so that we can see it, or more generally recognize it 
using our five senses. What corresponds to the light-collecting device in the micro- 
scope is a particle accelerator. Shining light on the object corresponds to making 
high-energy particles collide with target particles whose structure we want to inves- 
tigate. The scattered light is collected by objective lenses and focused to an image, 
which we observe through an eyepiece. In the accelerator experiment, we use a 
radioisotope detector to measure patterns of the scattered particles. By computer 
processing they can be transformed to a pattern that we can recognize as a mag- 
nified image. The magnification of a microscope is determined by a combination 
of lenses, but they merely transform the image to help human eyes to recognize 
the scattered pattern of light. The resolution, the ability of the optical microscope 
to discriminate, is largely determined by the wavelength of the light. The more 
powerful electron microscope uses an electron beam and instead of optical lenses, 
quadrupole magnets to focus the beam. The scattered electrons are illuminated on 
a fluorescent board to make an image that human eyes can recognize. In other 
words, the electron microscope is a mini accelerator-detector complex. 

The quantum principle dictates that particles have a dual wave/particle property 
and the wavelength is given by the Planck constant divided by the particle momen- 
tum (the de Broglie wavelength I = h/p ). Therefore, the resolving power is directly 
proportional to the momentum or the energy of the incoming particle. This is why 
we need high-energy accelerators to explore the structure of matter. The smaller the 
scale, the higher the energy. Therefore, to explore elementary particles, the highest 
energy technologically realizable at the era is required. This is why particle physics 
is often called high-energy physics. 

Figure 1.2, commonly referred to as the Livingston chart, shows the history of 
major accelerators and their energy. On the right are indicated the approximate 
ranges of energy that are suitable for the investigation of the particle structure at 
the specified level. We summarize important discoveries and tools used for them in 
the history of particle physics in Table 1.1. We can see that the most effective tools 
for particle physics are accelerators of the highest energy available at the time. The 
one with the highest energy currently is the LHC (Large Hadron Collider; a proton- 
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Figure 1.2 The Livingstone char 
constituent particles. 



proton colliding accelerator of total energy 14TeV) in Europe. It started operation 
in 2009. The LHC can explore a micro-size of ~ 10 -20 m and is expected to discover 
the Higgs particle. 



2 

Particles and Fields 



In particle physics, a particle is defined as an energy quantum associated with a 
"field". The field is an entity that is defined over all space and time (collectively 
called space-time) and is able to produce waves, according to classical physics, or 
quanta, in present day terminology, when excited or when some amount of energy 
is injected. In a microworld the quanta are observed as particles, but collectively 
they behave like waves. A quantum is the name given to an object that possesses 
the dual properties of both a particle and a wave. A typical example of a field is the 
electromagnetic field. It is created by a charge and extends all over space. It is static 
when the charge that creates it does not move, but it can be excited by vibrating 
the source of the field, the electric charge; then the vibrating field propagates. This 
is a charge radiating an electromagnetic wave. It is well known that historically 
quantum mechanics has its origin in Planck's recognition that blackbody radiation 
is a collection of countable quanta, photons. 

In 19th century physics, waves could only be transmitted by some kind of vi- 
brating medium and the electromagnetic waves were considered to propagate in a 
medium called the "ether". The existence of the ether was ruled out with the advent 
of special relativity, and people began to consider that the vacuum, though void of 
anything in the classical sense, has the built-in property of producing all kinds of 
fields, whose excitations are observed as quanta or particles. The Standard Model 
has advanced the idea that the vacuum is not an empty entity but rather filled with 
various kinds of exotic material, including the Higgs particle, and exhibits dynam- 
ical properties like those observed in ordinary matter. The vacuum as we view it 
nowadays is a kind of resurrected ether with strange attributes that nobody had 
thought of. In this chapter, we describe an intuitive picture of both particles and 
fields, define some associated variables and prepare basic tools and terminologies 
necessary for the treatment of quantum field theory. 



What is a Particle? 

There are several properties that characterize a particle. 



o. KGaA, Weinheim 
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(1) A particle is spatially localized at each instant and its number is countable. 

In classical mechanics, particle's mass m and velocity v or energy E and mo- 
mentum p can be defined and measured. In the nonrelativistic limit, where the 
particle's velocity is far smaller than the velocity of light c, these variables of a free 
moving particle are given by 

p = mv , E= l -mv 2 =l-, v = |v|, p = \p\ (2.1) 

where bold letters denote three-component vectors. However, when the velocity is 
close to c, Einstein's formulae 



have to be used instead. In the microworld, the energy and the momentum are 
more important quantities directly related to observables. We can define the mass 
and velocity in terms of the energy and the momentum. The relation can be derived 
from Eq. (2.2): 

E 2 = (pc) 2 + (mc 2 ) 2 , v=^ (2.3) 

Note, Eq. (2.3) also holds for particles with zero mass, such as the photon. When 
the mass vanishes, E — pc and v is always equal to c. 
(2) The particle can be created or annihilated. 

This results from Eq. (2.2). The mass of a particle is but one form of energy and 
can be created if enough energy exists, or the energy can be converted back to mass 
energy subject to selection rules of one kind or another. Examples of reactions are 

e + + e~ — ► y + y (2.4a) 



— ► p + P (2.4c) 

p +p~ — ► any number of ji's (2.4d) 

n + — >fi + + v fl (2.4e) 

n~ + p — ► K° + A (2.4f) 

As a result, the existence probability of a particular kind of particle is not con- 
served, and the one-particle treatment of the Schrodinger equation is not satisfac- 
tory for treating relativistic particles. Quantum field theory is needed to treat the 
production and annihilation of particles. 
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(3) A particle is not necessarily stable. 

The statement is a corollary to statement (2). The Equation (2.4e) says that the 
pion is unstable and decays to a muon and a neutrino. Its average lifetime t is 
measured to be approximately 2.6 x 10 -8 s. This means there is a time uncertainty 
At ~ r for the existence of the pion. Then the Heisenberg uncertainty principle 



A£- At 
s that there is also uncertainty in the energy or the n 






n the rest frame. 

Here h = h/2ji is Planck's constant divided by In and is also called Planck's 
constant when there is no confusion. In the case of the pion 



6.58 xlO" 22 MeV •: 
" 2.6 x 10" 8 s 
2.5 x icr 14 



~ 2.5 x 10 -14 MeV 



the uncertainty in the mass is, to all practical purposes, negligible. However, in the 
case of p meson, the lifetime is ~ 4.2 x 10 -24 s 1 ' and the mass uncertainty is 

6.58 x lO" 22 MeV • s , 



which cannot be neglected compared to its mass 780MeV/c 2 . The relative 
tainty is very large: 



Let us treat the 

is given by x, the partick 
If d N of N particles die 



nore quantitatively. When the average lifetime 
a brief time interval dt with a probability dt/r. 



In quanti 
square of 



N(t) = N e-' /r 

n mechanics, the 
ts wave function ip 



probability of a particle is proportional to the 



and the wave function of the particle in its energy eigenstate has time depender 
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the time dependence of the probability is given by 



r = - 



From the equations, we know that being unstable is equivalent to having a complex 
energy with negative imaginary part 7^ > 0. The next question is then what kind 
of mass distribution does the particle have? The wave function as a function of the 
energy can be expressed as a Fourier transform 



y,{E) = 



^J-o 



Assuming the particle wa 



dtip(t)e iEt ^ 

created at t = and x = 0, ip(t) = for t 

dtxp(x = 0,t)e m/ * 



(2.13) 
: 0, then 



= [ dtrp(0)e- 

T JO 



-ir/2)t/fc iEt/h 



V'(0) 



V^r" E-E + i(r/2) 
Therefore, the probability that the particle takes an energy value E is given by 
^(O)l 2 ft 2 



P(E)<x\{p(E)\ 2 



In {E-Eo) 2 + (r/2)2 



(2.15) 



The functional shape is shown in Figure 2.1 and is called a Lorentzian, the same 
function as the Breit-Wigner resonance formula, which describes unstable levels of 
nuclei. From the shape, F represents the full width at half maximum (FWHM) of 
the function. F can be regarded as the energy uncertainty of the unstable particle, 
hence the Heisenberg uncertainty principle 



AE- At-T-r = h 



(2.16) 




Figure 2.1 Lorentzian shape oi 
bution of unstable particles. 
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is reproduced. The unstable elementary particles such as p and A (1236) have the 
mass and width 

m(p) = 776 MeV/c 2 , r = 158 MeV (2.17) 

m[A (1236)] = 1236 MeV/c 2 , r = 115 MeV (2.18) 

Historically, they were observed as strong enhancements in the cross sections of 
71-71 and 7t-p scatterings, namely as resonances. The process strongly suggests 
them to be of dynamical origin, and in the early days of hadron dynamics they were 
treated as such. Today, all hadrons, including nucleons, pions and resonances, are 
identified as the energy levels of quark bound states and are treated on an equal 
footing. 

(4) A particle has spins and other degrees of freedom. 

Energy and momentum are not the only attributes a particle possesses. It acts as 
if it were spinning on some axis. Spin means angular momentum and its magni- 
tude is quantized to integers or half-integers in units of the Planck constant h. Spin 
components or a projection of the spin onto a certain axis (z-axis) is also quantized 
in units of h: 

S z = S, S-l, S-2,---,-(S-l),-S (2.19) 

If a charged particle has a finite spin, it has a magnetic moment. 2 ' 

Spin in quantum mechanics has, mathematically, all the properties of angular 
momentum, but the picture of a self-rotating object such as the rotating earth does 
not apply, because a point particle cannot have angular momentum in the classical 
sense (I = rxp). Therefore the spin component should rather be regarded as just a 
degree of freedom. For a particle of spin 1/2 like an electron, only (S z = 1/2, —1/2) 
is allowed. For the electron, energy, momentum, electric charge and spin orienta- 
tion exhaust its degrees of freedom. This is evident as only two S z = ±1/2 elec- 
trons can occupy the same orbit in a hydrogen atom. The Pauli exclusion principle 
forbids other electrons of the same attributes to occupy the same quantum state at 
the same space point. Other particles may have more attributes, such as isospin, 
color, strangeness, etc., which are generically called internal degrees of freedom. 

(5) A particle has a corresponding antiparticle. 

The particle and its antiparticle have identical mass, lifetime and spin values, 
but the spin-components and all the internal quantum numbers are opposite. The 
positron is the antiparticle of the electron. The photon is its own antiparticle, a 
situation that holds for other particles, such as neutral pions, which have no dis- 
criminating internal quantum numbers. The sum of spin components and internal 
quantum numbers of a particle and its antiparticle is zero, namely the same quan- 
tum number as that of the vacuum. When they meet, they annihilate pair-wise 
and become pure energy, usually in the guise of photons, gluons and other gauge 

2) A composite particle such as the neutron can generally have magnetic moment even if it is 
electrically neutral. Think of two oppositely charged particles rotating in reverse directions. Even a 
neutral elementary particle like the neutrino can have it due to higher order radiative 



18 I 2 Panicles and Fields 



Table 2.1 Elementary particles and their ch; 



Matter constituent particles 
Spin Electric charge 3 / 3 b Generation 



+2/3 + 1/2 u(up) c (charm) t (top) 

-1/3 -1/2 d(down) s (strange) b (bottom) 



bosons. If sufficient energy is given to the pair, the produced energy instantly re- 
materializes to any number of particle-antiparticle pairs or bosons of heavier mass 
within the allowed energy limit. 

(6) Matter particles and force carriers. 

Minimum elements of matter that can be extracted fall into two classes. One ex- 
erts the strong force and the other does not. The former are called hadrons, the lat- 
ter, leptons. Hadrons with half-integer spin are called baryons and include protons, 
neutrons and other heavier particles such as A,U, S, ■ ■ ■ . Those with integer spin 
are called mesons and include n, K, p, ■ ■ ■ . All hadrons are composites of quarks 
that have spin 1/2, but the quarks cannot be isolated (confinement). One might 
doubt the real existence of such objects, but there is ample evidence for their exis- 
tence inside the hadrons. Today, six kinds of quarks (u, d, s, c, b, t) are known with 
increasing mass, each having three color degrees of freedom and electric charge 
2/3 or —1/3 in units of the proton charge e. The leptons have spin 1/2, too. Six 
kinds of leptons are known, three kinds of charged leptons (e,fi, t) and their as- 
sociated neutral leptons (neutrinos) (v e , v tl , v r ). In the Standard Model, the mass 
value of the neutrino is assumed zero. 3 ' To summarize, the constituent particles of 
matter are six kinds of quarks and leptons distinguished by "flavor". They act pair- 
wise in the weak interaction and are further divided into three generations. The 
particle species are summarized in Table 2.1. 

Properties of the particles in the same row but in different generations are al- 
most identical apart from their mass, which becomes heavier with the generation 
(see Table 2.2). Known matter in the universe is made mostly of the first gener- 

3) In 1998, a phenomenon called neutrino this book, however, the neutrino mass may 

oscillation, in which v fl is converted to safely be assumed zero until later, when 

another kind of neutrino, was discovered in neutrino properties are discussed, 

atmospheric neutrinos. This implies a finite 
mass of the neutrino. Within the context of 



;sofquarksandleptons[311] 
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4.2 GeV 171 GeV 



c ? ~0.009eV ~0.05eV 0.511MeV 105.7MeV 1777MeV 

a Current mass which is determined by dynamical processes like scattering. 

b Observed v c , v fl ,v T are mixtures of mass eigenstates v u v 2 , v 3 . 

c For the neutrinos, only mass differences Am?. = \m] - m)\ are known. Consequently, 

mj » m 2 » mi, is assumed. Note: MeV = 10 6 eV, GeV = 10 9 eV 



Others are generated in high-energy phenomena or supposedly exist in 
e conditions such as the interior of neutron stars. The existence and role of 
the generation is an unsolved problem. 5 ' There is no clear evidence that higher 
generations do not exist, but the weak gauge boson Z° can decay into pairs of neu- 
trinos and antineutrinos producing no others except the known three kinds. If the 
fourth generation exists, its neutrino has a mass greater than m z /2 = 45 GeV/c 2 . 
Considering it an unlikely situation, the Standard Model assumes only three gen- 
erations. 

Quantum field theory requires the existence of a particle that mediates the in- 
teraction, a force carrier. It is a quantum of the force fields. Gauge theory requires 
the force field to be a vector field, hence the force carrier must have spin l. 6 ' Cor- 
responding to the four fundamental forces in nature, there are four kinds of force 
carrier particles, gluons for the strong force, photons for the electromagnetic force, 
weak bosons ( W , Z°) and gravitons for gravity. A list of fundamental interactions 
and their characteristics are given in Table 2.3. 

Vacuum particle? If the Higgs particle is discovered, there could be another cat- 
egory of spin fundamental particle. It may be called a vacuum particle from the 
role of the Higgs. However, whether it is fundamental or of dynamical origin has 
yet to be determined. A typical example of the dynamical Higgs is if it is composed 
of a it pair [277], just like the Cooper pair in superconductivity is composed of an 
electron pair of opposite momentum and spin orientation. 

4) There is an unidentified matter form in the seemingly mysterious existence of the muon 
universe known as "dark matter", which is expressed in a phrase by I. Rabi: "Who 
makes up ~ 22% of all cosmic energy. The ordered it?". The generation puzzle is the 
ordinary matter of known type makes up extension of the muon puzzle. 

only ~ 5%. The rest is unidentified vacuum 6) This is true for the Standard Model force 
energy called "dark energy". carriers. The quantization of the gravitational 

5) The muon, for instance, could be termed field produces a spin 2 graviton. This is 

a heavy electron. Except for the fact that it due to the special situation that the source 

can decay into an electron and neutrinos, of the gravity is mass, or more generally 

all the reaction formulae for the muon energy-momentum, which itself is a second 

may be reproduced from those of the rank tensor in space-time, 
electron simply by replacing its mass. The 
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What is a Field? 



2.2.1 
Force Field 



Force at a Distance, Contact Force and the Force Field 

In classical mechanics, the gravitational force is expressed by Newton's formula 

mM 
f = - G *— < 2 - 20 ) 

This means that an object with mass M exerts its force directly on another object 
with mass m at a distance r. Einstein considered such a "force at a distance" is 
unacceptable. To understand why, let's carry out a thought experiment. Suppose 
the sun suddenly disappears at t = and infer what would happen to the earth. 
According to Newton's way of thinking, the sun's attraction would disappear in- 
stantly, because the mass M, the source of the force, has disappeared and the earth 
would be thrown off its orbit into cosmic space at the same instant. According to 
Einstein's principle of relativity, nothing can travel faster than light, and the infor- 
mation of the sun's disappearance would take at least 8 minutes to reach the earth. 
During the 8 minute's interval there is no way for the earth to know that the sun 
has vanished, which means that the gravitational force continues to act. It follows 
that something other than the sun must be there to exert the force on the earth. 
We call this something a "field". The sun (mass) creates a gravitational field over 
all space and the force acting on the earth is the action of the local field at the same 
space point as where the earth is located. Namely, all forces must be "contact forces" 
that can act only upon contact with the object on which they exert. In this way of 
thinking, wherever a force acts there must be an associated field, a gravitational 
field for gravity and an electromagnetic field for the electromagnetic force. 
Take a look at the Maxwell equations of the electric force, which are written as 

V-E(t,x)= P -^, f(t y ,y) = qi E(t y ,y), Vx£(t,x) = (2.21) 

This translates to the statement that an electric charge distribution p at a space 
point x at time t creates an electromagnetic field E(x, t) over all space. A point 
charge q\ at a spacetime point (t y , y) receives the force from the local electric field 
E at the same space-time where it is located. The third equation is an auxiliary 
condition that the static field must satisfy. If the static electric field is created by a 
point charge q 2 at point r = 

p=qd(r), d(r) = d(x)d(y)d(z) (2.22) 

Solving Eq. (2.21) with the above input gives Coulomb's law 
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The result has the same form as Newton's law, but in the dynamical (i.e. time- 
dependent) solution the influence of a changing force never propagates faster than 
light. Namely, the Maxwell equations respect the principle of relativity. This is guar- 
anteed by the local nature of the differential equations. Integral forms are useful 
for a intuitive interpretation, but the equations have to be written in differential 
forms. Historically, Maxwell's equations were obtained empirically (except for the 
displacement current), then the Lorentz transformation was derived to make the 
force action consistent in the rest frame and in a moving frame, which is a basis of 
the theory of relativity. 

Photon 

An oscillating electromagnetic field propagates through space and becomes an elec- 
tromagnetic wave. It can be created by oscillating the source, but once created, it 
becomes a wave acting independently of the source. Quantum mechanics tells us 
that the wave behaves also as a particle. An entity exhibiting the dual behavior of a 
particle and a wave is called a quantum. The quantum of the electromagnetic field 
is called a photon. The above statement can be summarized as follows 



The principle of relativity requires the force field and the quantum principle 
requires quanta of the field, i.e. force carrier particles. 



The electromagnetic wave can carry energy and when it meets another electrical- 
ly charged object it forces it to oscillate, because classically it is an oscillating force 
field. But as a particle, when it hits the object, it is either scattered or absorbed, giv- 
ing it a kick. The energy and momentum of the quantum are given by the relations 

where v is the wave's frequency, w — 2jiv the angular frequency, X the wave- 
length and fe the wave vector. As we have seen, the creation of the electromagnetic 
(force) field is equivalent to the emission of a photon (force carrier particle) and 
the exertion of the force by the electromagnetic (force) field on the charge (source) 
is equivalent to absorption of the photon (quantum) by the charged object (anoth- 
er source). In quantum field theory, action of a force means an exchange of the 
force carrier particle. In other words, action/reaction of the force is synonymous to 
absorption/emission of the force quantum by the object on which the force acts. 
In a mathematical treatment of quantum mechanics, energy is transmitted often 
in momentum or energy space without ever referring to the space variables. The 
notion of a force is a bit different from that used in classical mechanics, where it 
refers to an energy variation associated with space displacement (/ = —dE/dx). So 
instead of saying force exertion, the word "interaction" is used where any exchange 
of energy occurs. 
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Strength of the Force 

In the following, we talk of (potential) energy instead of force. The electromagnetic 
interaction between, say, a proton and an electron in a hydrogen atom is expressed 
by the Coulomb potential and the strength of the force by a where 



4jre ftc 137 
Here, to define a dimensionless quantity a, we used the fact that he has the dimen- 
sion of [energy x distance], e is the unit electric charge that both the electron and the 
proton have, which has a value of 1.6 x 10 -19 Coulomb in rationalized MKS units. 
a is called the fine structure constant and represents the fundamental strength 
of the electromagnetic interaction. The dimensionless strength of the gravitational 
force can be defined similarly: 

in.™, / m . m -\ kr 

V(r) = 



where Gn is Newton's constant of universal gravitation and m p , m e are the proton 
and electron mass. In particle physics gravity can be neglected because (• • • ) ~ 0(1) 
hence a G is by many orders of magnitude smaller than a. The strength of the 
strong and weak force will be described in the following section. 

Problem 2.1 

(1) Calculate and confirm the values of the fine structure constant and the gravita- 
tional strength. 

(2) Calculate the mass value which makes a G = 1. Express it in units of GeV. It is 
called the Planck energy. 

Force Range 

Generally a particle has finite mass. What will happen if a force carrier particle has 
nonzero mass? To exert the force, it has to be emitted and absorbed by interacting 
objects. The finite mass of the force carrier if emitted produces energy uncertainty 
of at least A mc 2 . The Heisenberg uncertainty principle dictates that the time inter- 
val A t that allows energy violation should be less than ~ti/AE. If the force carrier 
is not absorbed within this time interval, the force cannot be transmitted. As the 
particle velocity is less than the light velocity c, the force range r should be less 

r <cAt~cti/mc 2 = ti/mc (2.27) 

Therefore, the range of a force mediated by a force carrier having a finite mass is 
limited to its Compton wavelength. 
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Let us treat the problem a little more quantitatively. As will be explained soon, any 
relativistic particle should be a quantum of a field that satisfies the Klein-Gordon 
equation 



In analogy with the electric potential, the force field created by a point source would 
satisfy the equation 



f fi 2 L = g<5 3 (r) 



where g denotes the strength of the force corresponding to the electric charge of the 
electric force. For a static potential where 3/9 X = 0, the equation is easily solved to 
give 



Derive Eq. (2.30) from Eq. (2.29). 
Hint: Use the Fourier transform 



, )= /i!i 

' J (2JZ) 



-H 



f "J (2tt) 3 

The reach of a short-range force is mathematically defined as Eq. (2.30), which 
is called the Yukawa potential. When r <JC r , the potential is of the Coulomb 
type, but for r ^> r , decreases very rapidly. For gravitation or the Coulomb force, 
r = oo and they are called long-range forces. Based on the above formula, Yukawa 
predicted the mass value of the n meson, the nuclear force carrier, to be roughly 
200 times that of the electron from the observation that the reach of the nuclear 
force is approximately ~ 10 -15 m. The strength of the nuclear force is measured 
to be g 2 /4jr ~ 10, which is 1000 times the electromagnetic force. The source of 
the nuclear force is a nucleon, but as the nucleon as well as the pion turned out 
to be composites of quarks, the real source of the strong force was later identified 
as the color charge, which all quarks carry. The true strong force carrier is called 
the gluon and its mass is considered to be zero, just like the photon. The potential 
between the quarks is of the Coulomb type, but for reasons to be described later, 
has an extra term and is approximately (phenomenologically) given by 

0(r) = fe 1 - + fe 2 r (2.31) 

The second term increases linearly with distance and is called the confining poten- 
tial, ki decreases logarithmically at short distance, a phenomenon called asymptotic 
freedom and takes a value ~ 0.1 at r ~ 10 -18 m. 
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The weak force carriers come in three kinds, charged W ± with mass of 
80.4GeV/c 2 and a neutral Z° with mass of 91.2 GeV/c 2 . Their mass value means 
r ~ 10 -18 m for the weak force. The range of the weak force is so small that 
practically it could be replaced with a contact force before the advent of big acceler- 
ators. The coupling strength was expressed in terms of the dimensionful quantity 
G F ~ 10 -5 m 2 for historical reasons, where m p is the mass of the proton. 

Problem 2.3 

From the mass values of W , Z°, calculate the weak force range. 



Problem 2.4 

From observation of stars and polarization of star light, it is known that a coherent 
magnetic field of ~ 3 (i((i = 10 — 6 ) gauss exists in the Crab nebula. This means 
that the electromagnetic force range is at least hundreds of light years. If the in- 
tergalactic magnetic field is confirmed, the range is even larger, millions of light 
years. Calculate the upper limit of the photon mass. 



2.2.2 

Relativistic Wave Equation 

Physical quantities that characterize a particle are energy E and p, but in quantum 
mechanics the particle is a quantum and shows wave behavior. The wave picture of 
a particle can be derived by replacing 



and regarding them as operators acting on a wave function. Inserting Eq. (2.32) into 
the nonrelativistic energy-momentum relation (2.1), we obtain a wave equation for 
the de Broglie wave: 

If the particle is moving in a potential, one adds a potential V(x) and the result is the 
well-known Schrodinger equation. To see if we can obtain a relativistic Schrodinger 
equation, we apply (2.32) to Eq. (2.3) and obtain 

which is called the Klein-Gordon equation. Equation (2.34) has a plane wave solu- 
<p{r) = <p(x, t) = }f e Hp-*-Bt)l* (2.35) 
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where N is a normalization constant. A general solution can be expressed as 
a superposition of plane waves. Unfortunately, unlike the nonrelativistic equa- 
tion (2.33), we cannot regard <p as a probability amplitude for a particle. If <p is to 
be regarded as a probability amplitude just like \p in the Schrodinger equation, 
the probability density p and its flow (current) j have to be defined to satisfy the 
continuity equation (i.e. conservation of probability) 

dp 

f t + V • j = (2.36) 

The current j may assume the same form as the Schrodinger equation: 

j =-^(<p*V<p-<pV<p*) (2.37) 

Then, in a relativistic treatment (to be discussed in Chap. 3), pc has to be the fourth 
component of the four-dimensional current vector. 7 ' 



dt dt , 



(2.38) 



It is easily verified that p and j defined above satisfy the continuity equation 
Eq. (2.36). 

One is tempted to interpret p as the probability density, but it does not work as it 
encounters difficulties. 

Negative Energy The first difficulty stems from the fact that the energy- 
n equation E 2 = (pc) 2 + (mc 2 ) 2 has two solutions 



t^ 



) 2 + (mc 2 ) 2 (2.39) 

Namely, E can be negative. The particle with negative energy cannot occupy a stable 
state. Because it can be transferred to an indefinitely lower energy state by emitting 
a photon, hence is unstable. Historically, people tried to construct a theory without 
the negative energy solution. Mathematically, however, they are required to satisfy 
the completeness condition as well as causality and trying to construct a closed 
system without them has produced various paradoxes. 

Negative probability The second difficulty stems from the fact that the Klein- 
Gordon equation is a second-rank differential equation and requires two initial 
conditions <p(t = 0) and dcp/dt(t = 0) to define its solution uniquely. In other 
words, tp and 3<p/3t can be chosen arbitrarily and do not always guarantee a positive 
value of p. In fact, if we calculate j" using the plane wave solution (2.35) we have 

J '" = |N|2 ^ = |N|2 (wm) (2 ' 40) 

from which we see that a negative energy solution also gives negative probability. 
For these reasons, the Klein-Gordon equation was once abandoned historically. 

7) c is multiplied to make the physical dimension the same as the space components. 
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2.2.3 
Matter Field 

However, we can still produce wave behavior by interpreting <p as a classical field. 
The four-dimensional current 

qj J, = {pc,j) = ihq(<p*^<p-<p^<p*), 9" = Q^,-v) (2.41) 

can now be interpreted as a current (charge flow) having positive as well as negative 
charges. From the classical field point of view, negative energy means negative fre- 
quency and simply means a wave propagating in the opposite direction. We shall 
see later that particles, or quanta to be exact, appear by quantizing the the ampli- 
tude of the waves. From the wave amplitude of negative frequency appear particles 
with quantum number opposite to that of positive frequency, as suggested by the 
current flow. Both quanta have positive energy and particles with negative energy 
never appear. As it happened, history took a detour in coping with the notion of 
negative energy. 

The above statement is equivalent to considering matter particles on an equal 
footing with the photon. Just as the photon is the quantum of the classical electro- 
magnetic field, we regard a matter particle as a quantum of some kind of matter 
field. To see the parallelism between the matter field and the electromagnetic field, 
observe that the electric field E and the magnetic field B are expressed in terms of 
a scalar and a vector potential 

E = -V0 - -— , B = V xA (2.42a) 

A' 1 = ( -,A\ (2.42b) 
In the Lorentz gauge, the equations that the potentials satisfy are given as 

f4i- v2 )^ = ?- ( 2 - 43a > 

where q denotes electric charge. In vacuum, where p = j = 0, Eq. (2.43) are 
equations for a freely propagating electromagnetic field, namely, the photon. Equa- 
tion (2.43) are nothing but the Klein-Gordon equation with m = 0. Note that <p 
which we have introduced is a scalar field while the photon is a vector field. The 
difference results in the spin of the field quantum, but does not affect the discus- 
sion here. 

The particle behavior lost by treating <p as a classical field can be recovered by 
quantizing the field (a process sometimes called as the second quantization), just 
as the photon is the quantized electromagnetic field. Quantization of the field is an 
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inevitable consequence of fusing quantum mechanics with the theory of relativity. 8 ' 
Since the field is a collection of an infinite number of harmonic oscillators (see the 
following arguments), the quantization of the field can be achieved by quantizing 
its harmonic components. 

2.2.4 

Intuitive Picture of a Field and Its Quantum 

Before we jump into the mathematics of field quantization, it may help to have a 
physical image of what the field is made of. We use an analogy to visualize the field 
to help the reader to understand the otherwise formal and mathematical procedure 
of quantization. Of course, there is always a danger of stretching the analogy too 
far and applying it where it should not be. True understanding is obtained only by 
interpreting the mathematics correctly. With those caveats, I proceed and pick up 
an example from a familiar phenomenon in condensed matter physics. 

The example we describe is metal, a form of crystal. In the metal, positively ion- 
ized atoms are distributed regularly lattice wise. A mechanical model of point mass- 
es connected by springs is a good approximation (Fig. 2.2). Atoms in the lattice can 
vibrate and transmit their vibration to neighboring lattices. Namely, the lattice of 
positively ionized atoms has the property of the field we want to talk about. The 
quantum of the lattice vibration is called a phonon. In the hypothetical conditions 
where the temperature is absolute zero and all the lattice vibrations come to rest, 
the electric field made by the lattice is uniform, at least at a scale larger than the 
lattice spacing, and therefore no net forces exist to act on the electrons. This is be- 
cause the direct Coulomb forces between the electrons are cancelled by negative 
Coulomb forces acting between the electron and the lattice. 

Put an electron at some space point in the metal, and its electric field would 
deform the regular lattice configuration (or field) in its neighborhood. Another 
electron placed at some distance would feel the field distortion. Namely, the two 
electrons can exert forces on each other through coupling with the field. If the elec- 

8) The field could be relativistic or nonrelativistic. In this book, we talk about only relativistic fields. 




illator model of a field. 
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tron is in motion, the changing field it produces can excite the lattice vibration or, 
in quantum mechanics, create a phonon. The vibration can propagate through the 
field and disturbs the other electron, or equivalently the electron is scattered by 
absorbing the phonon. 

This classical treatment of the electron coupling with the lattice vibration can be 
rephrased as an interaction of the electron and the phonon. The quantum treat- 
ment of the phonon-electron interaction and that of the photon-electron interac- 
tion in particle physics are very much alike, except that a relativistic treatment is 
hardly necessary in condensed matter physics. Historically, the latter framework 
(quantum electrodynamics, QED) was established first, with subsequent applica- 
tion to condensed matter. The phonon is the quantum of the field or a wave excited 
in a real medium made of atoms, and so it cannot be identified as a real particle that 
travels in the vacuum. Suppose the medium is infinitely large, the lattice spacing 
so small that we cannot realize its discreteness by any technology we can use, and 
the lattice is invisible. Then the phonon has every reason to qualify as a real parti- 
cle flying in the vacuum. Whatever the properties of the vacuum that surrounds us 
may be, we may consider it to be made of such a superlattice, 91 and has the ability 
to produce waves and transmit them. 

The caveats I warned about are as follows. The mechanical field model can create 
a longitudinal wave whereas the electromagnetic wave in vacuum is always trans- 
verse. The fields we deal with in particle physics have infinite degrees of freedom 
while the lattice degrees of freedom are many but finite because of the finite size 
and spacing, though for all practical purposes it can be treated as infinite. Absolute 
zero temperature cannot be realized, and the lattice is perpetually in thermal excita- 
tion. The metal environment provides electrons with a hot bath of phonons. When 
an electron travels, it is constantly scattered by the vibrating lattice or phonons. The 
net effect is a velocity-dependent resistive force, and the velocity saturates when the 
e force cancels the local electric force. 



2.2.5 

Mechanical Model of a Classical Field 

Here, we show that when the field is a collection of an infinite number of connect- 
ed harmonic oscillators distributed continuously in space, it satisfies the Klein- 
Gordon equation, which is the fundamental equation of motion for relativistic 
fields: 






- V 2 + fi 2 <p(t, x) - [9^3" + fi 2 } <p(x) = (2.44) 



In order to simplify the mathematics, we consider a one-dimensional oscillator 
(Figure 2.3) consisting of N + 1 point masses m with interval a spanning total 
length D = (N + l)a. Let u„_ 1; u„, u n+1 be the displacements of the the (n - 
l)th, nth and (n + l)th mass points, respectively. Then their equation of motion is 

9) We use this as a metaphor. In condensed matter physics superlattice has its own meaning. 
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^~|f M | rir „ I — L tfj , r , 

Figure 2.3 N + 1 mass points are connected by N springs of length a. u„- 
displacements of the (n - l)th, nth and (n + l)th points, respectively. 



r(-±i^)-r(» 



Here, we have assumed the springs had been prestretched to a length a to give ten- 
sion Tand that the additional increase of the tension is proportional to its relative 
stretch. Equation (2.45) can be derived from the Lagrangian 

where K is the kinetic energy and V is the potential energy, by using Euler's equa- 
tion 



(2.47) 



where u = du/dt. 

Making N large and keeping the tension constant, we put D = (N+l)a,x = na, 
Ax = a, and rename u„_ x , u„, u„ +1 as u(x - Ax), u(x), u(x + Ax). Then the 
equation becomes 



d 2 u 



[ u(x + Ax,t)-u(x,t) u(x,t)-u(x 



• f )l 



Divide both sides by a = Ax, define linear mass density a = m/a and use 
{u(x + Ax) — u(x)}/Ax ~ du(x)/dx. Then we obtain a wave equation for a wave 
propagating with velocity v. 



1 9 2 u 



(*,,)- (*- A*, t) 



,(*,t)--(*,t) = 0, 



^ 



(2.49) 
(2.50) 



If we consider N oscillators under the influence of gravity as in Figure 2.4, the 
Lagrangian (2.46) becomes [Problem 2.5] 



-e[t(^) , -=("-»-) , -^1 
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Figure 2.4 Connected pendul 



of gravity (I = length of the pendulums). 



where g = gravitational acceleration, and in the 



The equation has exactly the same form as the Klein-Gordon equatio 
see by the substitution v — >• c, a> /v — >• fi. 



Prove that the Lagrangian (2.46) is modified to Eq. (2.51) when the oscillators are 
connected pendulums of length I (Fig. 2.4). Prove, in the continuum limit, that the 
equation of motion becomes Eq. (2.52). 



Problem 2.6 

Show that the total energy of N + 1 pendulums is given by 

-?[?(ty+^(-+--MH 

In the continuum limit, the Lagrangian is given by 

'-HiM-tIsM'-^H ( - 



Redefining <p = vTu, ft = Ja>l/v 2 and recovering original 3-dimensionality, the 
Lagrangian and the energy become 

L ee J d 3 %£ = jjj dxdydz 1 - 1 ^ 2 - (V<p ■ V<p) - fi 2 <p 2 (2.55) 
E = J dix \\v^ln) +^f^<p)+fiV\ (2.56) 
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The corresponding Hamiltonian can be obtained by 

H = I d l xM = I d l x[py-£\ p = — (2.57) 

which of course reproduces Eq. (2.56). 

Taking the limit D — >• oo and putting v — >• c reproduces the Lagrangian for the 
Klein-Gordon equation: 

^9^-^ + ^ = ° < 2 - 58 ) 

The solution to the Klein-Gordon equation may be expressed as a superposition of 
plane waves (mathematically as the Fourier transform) 



' (2tt) 3 y n ' 
n-Gordon equatioi 
s of the quantum. 



(27lf 

When the Klein-Gordon equation is quantized as described in Sect. 5.3, fi 2 be- 



We have visualized the field that obeys the Klein-Gordon equation as three- 
dimensional connected springs distributed over all space. They have the following 
characteristics. 

1. A classical field is a collection of an infinite number of harmonic oscillators 
numbered by space coordinate x = (x, y, z) interacting with neighboring os- 
cillators. 

2. The amplitude difference in the discrete oscillators becomes differential in the 
continuum limit. This means an oscillator at point x interacts with neighbor- 
ing oscillators through spatial derivatives. However, as is well known in clas- 
sical mechanics, an infinite number of connected (i.e. interacting) harmonic 
oscillators is equivalent to the same number of "normal modes", which oscil- 
late independently of each other. The normal modes are plane waves, given 
in (2.59), and the interaction with the neighboring oscillators in the original 
mode is converted simply to propagation of the wave in the normal mode. An 
infinite number of interacting harmonic oscillators is equivalent to a collection 
of freely propagating waves numbered by the wave number fe. In short, an al- 
ternative way of viewing the field is to regard it as a collection of an infinite 
number of harmonic oscillators numbered by the wave number fe. 

3. Plane waves show particle behavior when they are quantized. As their energy- 
momentum is given by E = ha>, p = hk, the mass m of the quantum is given 
by 

,= ^a^ (2.60) 
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It is interesting to note the origin of the mass term pi or cu in the mechanical 
oscillator model. It did not exist in the medium that excites the wave. It was 
given as an external force effect acting on all the oscillators uniformly. This has 
a far-reaching implication. In the Standard Model, we shall learn that all the 
fundamental particles in nature have vanishing mass originally and acquire 
finite mass through interactions with the Higgs field, which pervades the sur- 
rounding space. Namely, the mass of an elementary particle is not an intrinsic 
property of its own but is an acquired attribute as a result of environmental 
changes. This concept served as a major breakthrough in elucidating the deep 
roots of gauge theory and so reaching unified theories. 



Natural Units 

Universal constants such as h and c appear frequently in equations. In order to 
avoid writing these constants repeatedly and risking mistakes in the expression, 
we introduce "natural units". All physical quantities have dimensions. They are 
expressed in terms of length, mass and time. The familiar MKS units we use are 
meter (m) for the length L, kilogram (kg) for the mass M and second (s or sec) for 
the time T. The dimension of any physical quantity Qis expressed as 

[Q] = [L«M b T°] (2.61) 

Velocity, for instance, has the dimension 

[v] = [LT- 1 ] = m/s (2.62) 

Similarly, the dimensions of other mechanical quantities, e.g. acceleration a, force/ 
and energy E, are expressed as 

[a] = [LT~ 2 ] = m/s 2 (2.63a) 

[f] = [MLT~ 2 ] = kg • m/s 2 = Newton (2.63b) 

[E] = [ML 2 T~ 2 ] = kg-m 2 /s 2 = Joule (2.63c) 

One need not, however, necessarily adopt MLT as the basic units to construct 
the dimensions of physical quantities. Any three independent physical quantities 
are equally eligible. Besides, MKS units are too large for applications in parti- 
cle physics. We may choose energy as one of the basic units and express it in 
electron volts (eV), an amount of energy obtained when an electron or proton of 
unit charge is accelerated by 1 V of electric potential. As the unit electric charge is 
1.6021 x 10~ 19 Coulomb 

1 eV = 1.6021 x 10" 19 Coulomb • Volt = 1.6021 x 10" 19 Joules (2.64) 
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Moreover as the extension of electronvolts 

lkeV= 10 3 eV 

1 MeV = 10 6 eV 

(2.65) 
1 GeV = 10 9 eV 

1 TeV = 10 12 eV 

are also used, depending on the kind of phenomena concerned. For the time being, 
we will use MeV as the standard energy unit. In quantum mechanics and relativity 
theory, h = h/2jr, where h is Planck's constant and c the velocity of light, frequent- 
ly appears. Adopting c, ft and energy as the three basic units, the description of 
physical quantities is simplified, as we describe now. Since 



1.055 x 10" 34 J s (Joules • s) = 6.582 x lO -22 MeV s 



(2.66a) 



he = 1.9733 MeV x 10~ 13 m (2.66b) 

Length and time can be expressed as 

[I] = [fie] x [E- 1 ] 

, (2.67) 

[T] = [h] x [E- 1 ] 

Namely, length is measured by inverse energy E~ x in units of 1.9733 x 10 -13 m. 
Time is also measured by inverse energy £ _1 in units of 6.582 x 10 — 22 s. Now we 
define natural units by putting ti = c = 1. Then the constants h and c disappear 
from all the equations. The velocity v becomes a dimensionless quantity measured 
in units of the velocity of light, which we had to write as ft — v/c previously. 
Every dimensionful quantity is expressed in terms of energy or inverse energy. 
The mass multiplied by c 2 and the momentum multiplied by c become energy, so 
they are all expressed in MeV. Sometimes the mass is expressed in MeV/c 2 , and 
the momentum in MeV/c, to differentiate them from real energy. To recover the 
original physical dimension, we only need to reconstruct the right dimension by 
multiplying or dividing by h and c. For instance, the pion mass m jr is 140 MeV, and 
we talk about distance or time as \jm n . Then 



- = 1.4 x 10~ ] 



1 _ tic[MeV-m] _ 1.97 x 10" 
~rn~ ~ 140 MeV ~ 140 

1 _ fc[MeV • s] _ 6.58 x 10" 22 s _ 24 

ln n ~ 140 MeV ~ 140 

When we talk of a cross section a being 1/ m 2 , its value in MKS units is given by 



M 2 



1.97 x 10" 13 [MeV-] 
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Problem 2.7 

Show the value of the classical electron radius r e is 2.82 x 10 -15 m, where r e is given 
by a/m e . a = 1/137 is the fine structure constant and m„ the electron mass, is 
0.511 MeV. 

Problem 2.8 

The neutrino changes its flavor periodically while it propagates, a phenomenon 
called neutrino oscillation. The probability of a v e changing into v ft at time t is 
given by 

P(v 6 ^ v„ : t) oc sin 2 ^-t (2.70a) 

Am 2 = \mj-m 2 2 \ (2.70b) 

If A m 2 is given in units of (eV) 2 , E in MeV, the length in meters, show that the 
oscillation wavelength I is given by 



In electromagnetism, we have used, so far, rationalized MKS units. From now 
on we will use Heaviside-Lorentz units, which put e = /<o = 1 and natural units 
instead of MKS units. 



3 

Lorentz Invariance 



In particle physics, most particles we treat are relativistic, i.e. E sa pc 3> mc 2 , 
and the theory of special relativity is an indispensable mathematical tool. A notable 
feature is that space and time mix and cannot be treated independently as in New- 
tonian mechanics. Physical objects that were considered as an independent three- 
component vector and a scalar in nonrelativistic physics mix in high-energy phe- 
nomena. They have to be combined to make a four-component Lorentz vector that 
transforms like a time and space coordinate (its difference to be exact). For their 
consistent and unified treatment, we have to rely on Einstein's theory of relativity, 
which is based on two principles. 

1. Invariance of the velocity of light, which postulates that the velocity of light 
always remains as the constant c in any inertial frame. 

2. Relativity principle, which requires covariance of the equations, namely the 
physical law should keep its form invariant in any inertial frame. In mathemat- 
ical language this amounts to the fact that physical laws have to be expressed in 
Lorentz tensors. 

The relativity principle applies to Galilei transformation and is valid in Newtoni- 
an mechanics also, but the invariance of the velocity of light necessitates Lorentz 
transformation in changing from one inertial system to another that are moving 
relative to each other with constant speed. 



Rotation Group 

Before introducing Lorentz transformation operators, it is useful first to review the 
familiar rotation operator in three dimensions, which makes a group 0(3). For the 
definition of a group, see Appendix A. The basic operations are very similar, and 
in fact the Lorentz group contains the rotation group as its subgroup. Let r B = 
{xb, Yb, Zb) be a point in a coordinate frame L that can be obtained from point r A 
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by rotation through angle 6 . Then in matrix form 

As the rotation does not change the length r = ^/x 2 + y 2 + z 2 , 
r 2 B = r\r B = r T A R T Rr A = r\ = r\r A 



Namely, R is a 3 x 3 orthogonal matrix. A set of three variables that transforms 
similarly by rotation is called a vector and is expressed as V = (V x , V y , V z ). As an 
example, let us write down a rotation around the z-axis by angle 9 (Figure 3.1a). 



v; 

Here, we have defined the rotation as moving the vector from point A to point B 
referred to as active transformation. Some authors use the word rotation to mean 
transformation of the coordinate frame without actually moving the vector, which 
we refer to as passive transformation. Mathematically, the active rotation is equiv- 
alent to passive rotation by angle —6 (Figure 3.1b), i.e. the passive transformation 
is an inverse active transformation. We use active transformation throughout this 
book. What we are concerned with is the effect of the operation on a "vector field 
V(x)", a vector defined at every space coordinate x. The effect can be expressed as 



V'(x) = D(R)V(R- a x) 
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where D(R) acts on the components of the vector. Expressed in terms of compo- 
nents 



V i '(x t ) = ^R IJ V J f^R^ 1 x i j 



To prove the above equation, let V ' (x) be a new vector field obtained from the old 
vector field V(x) by rotation. Both fields are defined at all space points x. Let V B = 
V(x B ) and V A = V(x A ) and V' A = V'(x B ). The vector V' A is obtained by rotating 
V A from point x A to point x B (Figure 3.1a). Note that the vector V B is the old vector 
field evaluated at x B and generally differs from V A or V' A in both magnitude and 
direction. We can write down three-component vectors in factorized form, namely 
as a product of their magnitude and a direction vector 

V A = \V A \e A , V A '=\V A '\e B (3.5) 

where e A = V A /\V A \ and e B = V B /|V B | are two base vectors of unit length 
connected by the rotation. | V| is the magnitude or length of the vector. The base 
vectors are specified by their direction only. Since the magnitude of the vector does 
not change by rotation, but the direction of the base vector changes, we have 

\V' A \ = R|V A | = |V A | = \V(x A )\ = |V(R _1 * B )| (3.6a) 

e B = Re A i.e. (e B )* = £ R u [e A )i = D(R)e A (3.6b) 

.-. V'(x B ) = \V' A \e B = \V A \Re A = R{\ V(x A )\e A } = RV(R- a * B ) 
= RumR-'x^t = D(R)V(R- x x B ) 



Rewriting x B as x, we obtain Eqs. (3.4). When the rotation angle around the z-axis 
is infinitesimal, we can obtain D(R z )(dd) from Eq. (3.3). 
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(3.7b) 


l also extract an operator that carrie 
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ments as follows. For any function of space point x 
f(R~ 1 x) = f(x + ddy,y-d8x,z) 

= f(x,y,z) + de(yd x -xd y )f(x,y,z) 
= {l-idOL z )f(x,y,z) 

L z =-i(xd y -yd x ) 
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L z is the familiar orbital angular momentum operator and S z is the spin angular 
momentum operator. Combining the operations on the coordinates and compo- 

Rz(d6)V(R- 1 x) = (l-idej z )V(x) 

J z = L z + S z ( ' ' 

Rotation through a finite angle can be obtained by repeating the infinitesimal 
rotation 68 = d/nn times and making n ->■ oo: 



R z {e)=\rmJ\(l-i] z e -j 



] z is called a "generator" of rotation on 
similarly. 
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(3.11) 


They satisfy the following commutation relations: 






[Ji, Jj] = ie ijk J k 






(3.12a) 


[LuL^iBipLt 






(3.12b) 



fiiljfc = 



I + 1 ijk = even permutation of 123 
j— 1 ijk= odd permutation of 123 



The relation Eq. (3.12) is called Lie algebra and eijk is called the structure func- 
tion of the rotation group 0(3). The rotation can be specified by a set of three 
parameters, a unit vector to define the rotation axis and the rotation angle 6 , and is 
expressed as a rotation vector . Generally, the 3x3 rotation matrix has nine vari- 
ables, but they are reduced to three because of the orthogonality relation (3.2b). It 
is known that essential properties 1 ' of the rotational group 0(3) (or more generally 
those of a Lie group) is completely determined by the three generators and their 
commutation relations (i.e. structure constant). The rotation operator is expressed 



R(n,6) = e'^- = e^'" 1 



Lorentz Transformation 



3.2.1 

General Formalism 



Introducing a coordinate variable x° = ct, we define two kinds of Lorentz vector 

dx 1 ' = (dx°, dx 1 , dx 2 , dx 3 ) = (cdt, dx, dy, dz) (3.15a) 

dx fl = (dxo, dx u dx 2 , dx 3 ) = (cdt,-dx,-dy,-dz) (3.15b) 

Lorentz transformation is defined as one which keeps the Minkowski metric, the 
scalar product of the two vectors defined by 



ds l . 



-dy 2 - 



= dx f 'dx fl = g uv dx^'dx 



(3.16) 



invariant. Here we adopted the convention (Einstein's contraction) that when the 
same Greek index appears simultaneously as a superscript and a subscript, sum- 
mation from to 3 is assumed, "g,,/ defined by Eq. (3.16) is called the metric 
tensor and is expressed in matrix form as 















Minkowski space is defined as a space having the metric (length) defined by 
Eq. (3.16). In Minkowski space the inverse of the metric tensor is the same as 
the metric itself, a feature which does not hold in general. 2 ' In general relativity, 
the metric tensor is a function of space-time coordinates and the Einstein field 
equation is a differential equation for g /lv . Introducing a column vector x for the 
vector with upper index and row vector x T for the vector with lower index, the 
(homogeneous) Lorentz transformation can be expressed in the form 



of length without distinc 



r and lower index plays a minor role in Minkowski space. In fact, in the 
/as used to keep the metric in the same form as the Euclidian definition 
>n of upper and lower indexes. But it is essential in general relativity. 
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The general homogeneous Lorentz transformation includes space rotation and 
constant velocity parallel transformation, called (Lorentz) boost, which are specified 
by two vectors, co and v. The magnitude of the rotation vector to represents a rota- 
tion angle, and its direction the rotation axis. Consider the special case of Lorentz 
boost in the x direction. Here, a particle at (t, x, y, z) in a coordinate frame L is 
boosted to (t', x' ', y', z') with velocity v. This statement is equivalent to changing to 
another coordinate frame L' which is moving in the x direction at velocity —v. L' is 
assumed to coincide with I at t = t' = 0. Then the two coordinates are related by 



t ->• t' = 



, t + (v/c 2 )x 
' Vl-(v/c) 2 



yi-(v/c) 2 



= y(Px° + x) 



(3.19a) 
(3.19b) 
(3.19c) 
(3.19d) 
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A combination of four variables that transforms like the coordinate x 
"igorously the difference of the coordinate dx") is called a 
s denoted as A fl . A covariant vector is defined as 



Conversely, the 
tensor 



-A 3 )«(A°,-A) 

n be defined using 



(3.21) 
e of the metric 



A" = 



f v A v = (Ao, 



-A 2 ,-A 3 ), 



(3.22) 



In the nonrelativistic limit, the 0th (time) component and the space components 
are separated, the time component behaving as a scalar under space rotation, hence 
there is no distinction between the two types of vectors. However, under the Lorentz 
transformation, the distinction has to be made. The contra- and covariant vectors in 
Minkowski space can be exchanged simply by changing the sign of the space com- 
nsformation. The Lorentz transformation of 



ponents, a 



n called parity ti 
s expressed as 



Putting A fl = dx fl , 

dx' fl dx'" = (dx T M)(Ldx) = dx fl dx u = dx T dx 



(3.24a) 
(3.24b) 



which means the Lorentz transformation of the covariant vector is given by the 
inverse transformation. In a boost like Eq. (3.19), it is obtained simply by the sub- 
stitution v — >• — v. 



Problem 3.1 

Show that a boost L„ in general direction (along v) is given by 



Vy Py 1 

\jY l+P®P(Y-l)\ 



' Y PxY PyY 

P*Y Hy+^+H PJAy-1) 
P Y Y hh(Y-V Pl + ftyY+fc 



PzY 

^Jz(y-i) 



JzY PzPxfr-1) PzPy(Y-V Pl + P 2 Y +PlY. 



Hint: Separate the three-dimensional vector in the longitudinal (\\ft) and the trans- 
verse directions (_L /J) 



x=[x-P(fix)]+p(px) 

and apply the boost to the longitudinal part. 



3.2.2 

Lorentz Vectors and Scalars 

We define Lorentz vectors and Lorentz scalars, which will appear frequently in 
quantum field theory. An important physical variable is the proper time x. It is the 
time an observer feels in the observer's rest frame. 



Proper Time The proper time dr = dty/i — f: 



a Lorentz-invariant scalar. 



Proof: ds 2 = (cdt) 2 - dx 2 - dy 2 - dz 2 = c 2 dt 2 {\ - (dx/dt) 2 - (dy/dt) 2 - 

(dz/dt) 2 } = (cdt) 2 (l - /3 2 ) = (cdz) 2 

is Lorentz invariant by definition. □ 
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Since a vector multiplied or divided by a scalar is also a vector, it immediately 
follows that the variables in Table 3.1 are Lorentz vectors. 



Contravariant vector Covariant vector 



Coordinate difference 


dx" 


«/.v«, 




(3.27a) 


Velocity 




uf = dx"/dr = (yc,yv) 


u,, = dx u /dr 




(3.27b) 


Acceleratioi 




a'' = du"/dr 


a fl = du fl /dr 




(3.27c) 


Energy-moi 




P" -mi* -Wet) 


Pfl = (E/c-p) 




(3.27d) 


Wave vectoi 




k"=(co/c,k) 


^-(w/c.-h) 




(3.27e) 


Current 




jf = p u'' = (pc,j)* 


j/, = (pc,-j) 




(3.27f) 


Electromag 


netic potential 


A" = (4,1c, A) 


A lt =(4lc-A) 




(3-27g) 


Differential 


operator 


3 " = 4: = (^.- v ) b 


h = £ir = {\$i 


■ V) 


(3.27h) 



a pa is (charge) distribution in its rest frame. 
b Notice the minus sign of the space compone 



Notice, space components of the four-velocity vP differ from conventional ones 
by the factor y . 

Problem 3.2 

Prove that (3.28e), (3.28g), (3.28h) are Lorentz vectors. 

Scalar Product If the scalar product of a given Lorentz vector A? and a four-com- 
ponent variable B fl satisfy the equation 

A'^B'^ = A'% = A u B f ' = A' fl B"' = constant (3.28) 

then B" is a Lorentz vector. We shall denote the scalar product as A ■ B = A^ B''. 3 ' 
Proof: 

A /,« B / = V< p A p B'^ = A p B p :. B' fl L f ' p = B p 
Multiplying by M„ = [L _1 ]£ from the right 

B'^L^MP = B' fl d f a = B' a = B p MP 
which shows B a is a covariant vector. □ 

By making scalar products with two vectors, it is shown that variables in Table 3.2 
are Lorentz-invariant quantities. 
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Name 






Scalar 




Einstein's relation 






p 2 = p"p t , = (E/c) 2 -p 2 = (mc; 


\ 2 (3.29a) 


d'Alembertian 






3^ = jr-jj^- V 2 = D 


• 


Phase of a wave 






kx = *:,,%•" = ojt-h-x 


(3.29c) 


Current conservatio 


nlaw 




8W-S? + V-J-0 


!.29< 


Phase space volume 






8(p°)d(p 2 -(mc) 2 )d 4 p = £f- c 


(3.29e) 


Volume element in 


Minkowski 


space 


d 4 x = dx°dx 1 dx 2 dx i 


(3.29f) 



Equation (3.29f) can be proved by calculating the Jacobian of the transformation: 
dV = \-£- \d 4 x = (detL)d 4 x = d 4 x [See Eq. (3.36)] (3.30) 



Comment 3.1 In high-energy experiments, where E ~ pc, theproduc 
is often expressed in approximately Lorentz-invariant form 



d 3 cr ^ 1 d 2 a 
JiPpJE ~ 1 dEdQ 



Space Inversion and Time Reversal 

Since the Lorentz transformation keeps the metric ds 2 invariant in any inertial 
coordinate frame, 

ds 2 = g, lv dx u dx" = g/iv dx<"dx v ' = g tlr V' p V dx^dx a 

= g pa L p /i L a v dx ft dx v (3.32) 

■'• guv = gpaK L "v 

where in the second line, indices were interchanged since it does not make any 
difference when summation is carried out. It can be rewritten in matrix represen- 
tation as 



G = L T GL 


(3.33) 


Multiplying by G _1 from the left 




1 = G _1 L T GL = [GLG _1 ] T L => L _1 = [GLG _1 ] T 


(3.34) 


where we have used the relations G T = G, [G _1 ] T = G~ l 


. From Eq. (3.34) 


detL -1 = detGdetLdetG -1 


(3.35) 
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Using det L _1 = 1/ det L, we obtain 

detL = ±1 

We see there are two kinds of Lorentz transformation, depending 
detL. Taking the (0-0) component of Eq. (3.32), 



(3.36) 
q the sign of 



= (i o) 2 -E( L o) 2 ^l L ol 



The transformation with L° > 1 is called orthochronous and the one with L° < 1 
is called non-orthochronous, meaning "changes or does not change the time direc- 
tion". The Lorentz transformation can be classified into four classes depending on 
the sign of detL and L° , and that with detL > and L° > is called the proper 
Lorentz transformation L p . 

The proper Lorentz transformation becomes a non-transformation or identity 
transformation in the limit when the transformation parameter becomes 0, namely 
it is continuously connected with L = 1. It includes boost as well a 
three-dimensional space. The rotation is expressed as 

0" 



Defining P, T and PT by 

"10 0" 







thendetL p = det(PT) = l,detP = detT = -1. Pis called space inversion or parity 
transformation, T time reversal and PT total reflection. Their transformation is dis- 
crete and is not connected smoothly with the identity transformation. Any Lorentz 
transformation can be expressed as a combination of the proper transformation 
with the three discrete transformations and needs six independent parameters (ve- 
locity and rotation vector) to specify it. 
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Covariant Formalism 



3.4.1 
Tensors 



Products of two vectors transform under Lorentz transformation as 

A f "B"' = L f ' p L" a A p B a (3.40a) 

A/B v ' = M p u M" v A p B a (3.40b) 

A f "B v ' = L f ' p M" v A p B a (3.40c) 

and make contravariant, covariant and mixed tensors of rank 2, which are called the 
direct product of two vectors. More generally, any set of 16 variables that behaves 
in the same way as the product of two vectors is called a rank 2 Lorentz tensor. The 
concept can be extended to arbitrary rank r 

T f,v '" po ... ~ A fl B v ---P p Q a --- product of r vectors (3.41) 

Here, ~ means "transforms like". By multiplying a tensor by <5£, g,,„ or g p " and 
making a contraction (summation of the same upper and lower indices), the rank 
r tensor can be converted to a new tensor of rank r — 2: 

d p T^"- pa ... = T"'~ 

T""" pff ... (rankr)^ g^T f """ pa - = T"" p 

g p "T p "- pa ... = T'f>" 
= rank r — 2 tensor 

We introduce another useful tensor, called the Levi-Civita tensor, defined as 

(±1, fivpa = even (odd) permutation of 0123 

(0, any pair ot/ivpo are equal (3-43) 

,.0123 _ , _ 1 

e — —^0123 — l 

The Levi-Civita tensor is a Lorentz tensor that transforms to itself because 

e" vpo ' = L'' a L v b L p c L° d e ahcd = r"" p<J detL = e>' vpo (3.44) 
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3.4.2 
Covariance 



When equations are made of tensors of the s; 
any inertial frame. For example, 



e rank, they keep the s; 



A^ lv = C =* A/B f "" = C' (3.45b) 

They are said to be (Lorentz) covariant. By transformation the values of both sides of 
equations may change, but the functional form is kept invariant. Note, Lorentz co- 
variance is not to be confused with Lorentz invariance in strict sense, which means 
that the value itself does not change under Lorentz transformation and which ap- 
plies only to Lorentz scalars. Note, however, many authors including myself often 
use Lorentz invariance to mean Lorentz covariance. The principle of relativity re- 
quires covariance under transformations. In special relativity, the transformation 
is Lorentz transformation; in general relativity it is general coordinate transforma- 
tion. 

To get used to the usage of covariant formalism, let us take a look at the familiar 
Maxwell equations: 



V-E = q^, 



-^■S7=^« 



(3.46a) 



The electric field E and the magnetic field B can be expressed in terms of poten- 
tials: 



Since 9" and A" = (<p/c,A) are both Lorentz vectors, Equation (3.47) may be 
rewritten as an antisymmetric tensor of rank 2 






-Ei/c 


-E 2 /c 


-Ej/c- 


\/c 





-B 3 


B> 


Vc 


B 3 





-By 


Vc 


-B, 


Bi 






pt,v = d v A v _9"a'' = 



This means that electromagnetic fields look like two sets of vectors but actually 
are different components of a single second-rank tensor. Using F'"', the Maxwell 
equations (3.46) can be expressed in two sets of tensor equations 



9« FvX + dv Fh, + dx i> = (F uv = g 



(3.49b) 



3.4 Covariant Formalism I 49 



Problem 3.3 

Confirm Eq. (3.48) and Eq. (3.49). 

Problem 3.4 

Prove that a dual tensor defined by 

exchanges E -* B, B ->■ —E, and Eq. (3.49b) can be rewritten as 
9„F>" = 



From the definition of the electromagnetic tensor F'"', the Maxwell equation 
without sources (3.49b) is actually an identity equation. Namely Eq. (3.46b) is 
equivalent to saying that the electromagnetic potential A fl is a Lorentz vector. In 
Eq. (3.49), the two Maxwell equations are written as rank 1 and rank 3 tensor 
equations. In this expression, Lorentz covariance of the Maxwell equations is quite 
evident. Historically the converse held. The Maxwell equations were derived first, 
then Lorentz transformation was devised to make the equation covariant, which 
led Einstein to special relativity. 

Problem 3.5 

By its construction, f fl = qF ftv u v , where u 1 ' is defined in Eq. (3.28b), is a con- 
travariant vector. Show that in the nonrelativistic limit its space component reduces 
to the Lorentz force F = q(E + v x B) and hence /" is its relativistic extension. 



3.4.3 

Supplementing the Time Component 

In constructing a covariant equation, sometimes it happens that we know only the 
space component of a four-vector; the time (zeroth) component is unknown. We did 
not know the time component of the Lorentz force, but the four-vector defined in 
Problem 3.5 naturally supplements it. In the following we show two more examples 
to explicitly construct the yet unknown component of the four-vectors. 

Example 3.1: Newton's Second Law 

I Here, we construct a four-vector /'' which represents a force whose space compo- 
nent, in the nonrelativistic limit, reduces to F that appears in Newton's second law. 
Referring to the list of Lorentz vectors Table 3.1, a Lorentz-covariant extension of 
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the second law may be expressed as 

m 7T = / "' /" = (/°'/) = (/°'y F ) < 3 - 52 > 

It is straightforward to show that the space component of Eq. (3.52) reproduces 
Newton's second law 

To derive the time component of the four- vector /" , we differentiate a Lorentz 
scalar u^ 1 = 1 and use Eq. (3.52) to obtain 

-(*£)--(■•£-£)-"•-'- 

u c c 

Then the time component of Eq. (3.52) becomes 

The right hand side of the equation is the rate of energy increase, and the above 
relation led Einstein to derive the famous relation E = mc 2 . 



Example 3.2: Equation of Spin Precession* 

We derive here the spin precession frequency of a particle moving in a homoge- 
neous electromagnetic field. Although the quantum mechanical motion should be 
derived using the Dirac equation (see Chap. 4) with inclusion of a Pauli term, the 
expectation value of the spin operator is known to follow the same time dependence 
as that would be obtained from a classical equation of motion [47, 216]. The spin 
operator is an antisymmetric tensor S flv (see Sect. 3.5), but a formulation in terms 
of a covariant (axial) four -vector would be much more convenient and we will try to 
construct it. We define the spin four-vector S" which reduces to the conventional 
spin S^ = (0, s) in the particle's rest frame (R). 4 ' It obeys the equation of motion 

£=«£' XB ' q = ±e (3 ' 56) 

where ft = gqh/2m (h and c omitted in natural units) is the magnetic moment 
of the particle and also defines the g-factor. Denote the particle's four-velocity by 

4) Actually, S'' is the dual of the tensor S'"' in the sense that S>' = (l/2c)e''"'" J u„ S pa , where u>' is 

5) We assume the equation is linear in the spin S 1 ' and the external field F'"'. 
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u" = (u°,u) = y (1, v), which obeys the equation of motion 

m^- = qF''"X, = q(u-E, u°E + ux B) (3.57) 

where we used Eq. (3.52) and the Lorentz force qF f,v u v supplied by the electro- 
magnetic field. To obtain the zeroth component S° of the spin vector, we consider 
a Lorentz scalar u ■ S that vanishes in the rest frame and hence in any frame. Then 

S-u= SV -u- S =0^ S° = v S (3.58) 

The equation of motion for S° in the rest frame is readily obtained 

f = ^ + v.^l S . E (3.59) 

Eq. (3.56) and Eq. (3.59) may be combined and generalized to a Lorentz covariant 
form: 5 ' 

^ = ^[^+<^"».>»']-!(£-)»' 

= g 2^ Ff "' Sv + (S^) ^( S " F '"" M *) M " < 3 - 60a ) 

F f,v S v = (S-E, S°E+ SxB) = (s-E, s x B) (3.60b) 

S^F^u,, = S°(u- E)- u°(S-E)-(S-ux B) = -s ■ E (3.60c) 

where (3.57) is used in obtaining the second line. Equation (3.60a) is the relativistic 
equation of motion of the spin vector. It can easily be shown that Eq. (3.60a) reduces 
to Eq. (3.56) and Eq. (3.59) in the rest frame if one makes use of Eq. (3.60b) and 
(3.60c). For experimental use, we are interested in the spin precession frequency. 
The formula Eq. (3.60a) is used to determine a precise value of the g — 2 factor of 
the muon, which will be treated in more detail in Sect. 8.2.2. 

3.4.4 
Rapidity 

Successive Lorentz boost in the same direction is represented by a single boost 
where the transformation velocity is given by 



Proof: Assume velocity v' in frame L is observed as v" in frame L' , where the 
frame V is traveling in the x direction with — v in frame L. Space coordinates 
(t', x 1 ) are expressed in terms of (t, x 1 ) using transformation of Eq. (3.19). We 
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omit other coordinates for simplicity: 

x° f = Y(x° + Px 1 ) (3.62a) 

x 1 '=Y{fix + x 1 ) (3.62b) 

p ' = 7 = S (3 ' 62c) 

Then 

P dx«' y(dx° + l3dxi) 1 + 00' [ ' 

Equation (3.63) shows the velocity is not an additive parameter, i.e. nonlinear, 
in the successive transformation. A more convenient additive parameter "rapidity" 
can be defined by 

/3 = tanh>7, or r, = i In i±| (3.64) 

Problem 3.6 

Show that the rapidity is additive, i.e. rj" — t] + r/'. 

Using the rapidity, a Lorentz transformation with finite t], can be decomposed 
into N successive transformations with rapidity At] = rj/N, which we discuss 
more in the following section. Solving /S, y in terms of rj we have 

P = tanh ?j , y = cosh ?/ , /3y = sinh r\ (3.65) 

Lorentz boost (3.19) can be rewritten as 

x° = cosh j]x° + sinh r]x 1 (3.66a) 

x 1 = sinh t]x° + cosh?/ x 1 (3.66b) 

Compare this with rotation in the x-y plane. 

x' = x cos - y sin (3.67a) 

y' = x sin + y cos (3.67b) 

As Eq. (3.66) can be obtained from Eq. (3.67) by the substitution 6 — >• — i?;, x — >■ 
ix°, y — >■ x 1 , Lorentz boost (in the x direction) is formally a rotation by angle — it] 
in the x and imaginary time (ix°) plane. 
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Comment 3.2 In high-energy collider experiments, produced secondary particles are 
boosted in the z direction (along the incoming particle axis). The boosted angular distri- 
bution is better expressed as rapidity distribution. dN/dt] ~ const, is observed experi- 
mentally. At high energies, each particle has E ~ pc, p\\ = p cos 6, and its rapidity is 
approximated by so-called pseudo-rapidity 

„'=Ilni±^ = Iln^±^-lntan^ (3.68) 

2 I-/?,, 2 E-p\\c 2 l ' 

This fact is taken into account in designing detectors, which are divided into modules 
that span the same solid angle in the rj-(p (azimuthal angle) plane. 



In order to obtain a Lorentz transformation operator that will act on a field ip, 
where ip can have multiple components, the Lorentz operation of the field can be 
defined similarly to rotation [see Eq. (3.4)] 

ip(x)±ip'(x)=D(L)y,i(L- l x) (3.69) 

where Lx means for small boost 

x ' = x° + Ar/x 1 , x 1 ' = x 1 + Aijx° (3.70) 

where we have considered boost only in the x direction for simplicity. 

First we consider Lorentz transformation operations of the scalar field. Here only 
the coordinate variables are affected: 

xp'(x) = ip(L~ l x) = rp(x° - Ar]x\ x l - A V x°) 

= y(x°, x 1 ) - AtitfdoV + x d t ip) = (1 - iAr 1 N 1 )rp(x) (3.71) 

As a transformation with finite i] can be achieved by N successive infinitesimal 
transformations with At] = t]/N: 

\p'(x)= lim {l-i(?//N)N!} N ^ = exp(-it]N t )ip (3.72) 
N\ is called a generator of Lorentz boost and is defined by the formula 

Nj = -i(x°di + x l d ) = i(x°a 1 - x^ ) (3.73a) 
Similarly, considering boosts in the x 2 and % 3 directions 

N 2 = i(x°d 2 - x 2 d°) (3.73b) 

N 3 = i^d 3 - x 3 9°) (3.73c) 
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As is well known, orbital angular momentum operators, which are generators of 
space rotation, are given as 

L i = -i(x j d k -x k d j ) = i(x J d k -x k d J ), (i,j,k) = (1,2,3) and cyclic 

(3.73d) 

It is easily verified that N; and Lj satisfy the following commutation relations: 

[Ii,Ij] = iey k I* (3.74a) 

[Ni.Nj] = -ie ijk L k (3.74b) 

[N t ,L } ] = i e , ]k N k (3.74c) 



j + 1 ijk = even permutation of 1 2 3 
(— 1 = odd permutation of 123 



(3.74d) 



Now define Lorentz generators Af" and Lorentz variables (u ftv by 

M f ' v = i(x"d v - x v df) (3.75a) 

M f,v = -M"'' , w flv = -co F/ , (3.75b) 

(M 23 , M 31 , M 12 ) = (lj, I 2 , I 3 ) (3.75c) 

(M 01 , M 02 , M 03 ) = (Ni, N 2 , N 3 ) (3.75d) 

(«23,«31,«23)=(01,#2,03) (3.75e) 

(©oi, ft) 2, W03) = (Vi: V2> Vi) (3.75f) 

Then the general Lorentz transformation operators that act on coordinate variables 
can be written as 

Is exp[-i(N-ii)-i(L-0)] = exp ( - l -m liv M IAr ) (3.76) 

Next we consider Lorentz transformations on multi-component fields and include 
spin operators. Then a general Lorentz boost operator is expressed as 

M f ' v = i(x"d v - x v df) + S f,v (3.77a) 

(S 23 , S 31 , S 12 ) = (Si.Si^i) (3.77b) 

(S 01 , S 02 , S 03 ) = (Kl K 2 , JC 3 ) (3.77c) 
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where S and K satisfy the same commutation relations as I and N in Eq. (3.74). 
In the following we consider a Lorentz transformation of the vector fields. 61 It can 
be expressed as 



V"(x)^V I "(x)=L" v V v (L- 1 x) 



(3.78) 



(a) By inspecting Eq. (3.66) and Eq. (3.67) and generalizing to other directions, show 
that infinitesimal proper Lorentz transformation with = 66, tj = dtj acting on 
vector components is given by 



(b) Show also that 



(c) Show that Eq. (3.32) alone constrains e uv to be antisymmetric. 

As the Lorentz vector transforms in the same way as the space-time coordinates, 
we can derive the spin part of the Lorentz generator S" v by using Eq. (3.66) or 
(3.79). Let us derive K\, the Lorentz boost generator in the x direction. Writing 

L$ = [l-idii-K-ide-Sfi (3.80) 

we can derive Ki by 

i 0" 





0. 



Other generators are obtained similarly. 

Problem 3.8 
Show 



"0 i 0" 




i 



, K 2 = 


.0 0_ 





"0 


( 0" 






i 






■ i?3 = 


_0 


0_ 
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i 





0_ 



of particles with half-integer spin, see Appendix A. 
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"0 0" 




"0 





0" 




"0 0" 


Si = 




-i 


, s 2 = 










i 



■ Si = 


-i 
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_0 i 0_ 




_0 


-i 


0_ 




_0 0_ 












(3.82b 


Show also that iC; and S 


satisfy the sar 






tion relation 


Eq. (3.74) as N 


and L,. 

















So far we have considered only homogeneous Lorentz transformation. However, 
the reader should be aware that the Minkowski metric Eq. (3.16) is invariant under 
more general Lorentz transformations known as the Poincare transformation: 

x f " = V' v x v + af , x' = Lx + a, L = [L'\] = -^ (3.83) 

If we write the transformation as T(L, a), it satisfies 

T(L',a')T(L,a)=T(L'L,L'a + a') (3.84) 

and makes a Poincare (or inhomogeneous Lorentz) group. If we consider an in- 
finitesimal transformation, the transformation matrix can be written as 



The infinitesimal translation c 



l be expressed as 

") = /{*") " e"9/</(^') = [1 + ie fl (id")\f(x") 

= (1 + ", *")/(*") 
P" = id" = i(9 , -V) (3.86) 

The homogeneous Lorentz transformation is already given by Eq. (3.75). Therefore 
the inhomogeneous Lorentz transformation operator can be written as 

/(I" 1 * - a) = (l - ^«VM"" + ie,P^ f[x) (3.87) 

The finite transformation is obtained as usual by successive operation and is given 
by 



U(L,a) = exp ■ 



,.M>" 



(3.88) 
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Fields are transformed by the Poincare group operation as 

ip'(x) = D(L, ajip^x - a) = U(L, a)ip(x) (3.89) 

where Af" now includes the spin operator. Definitions Eq. (3.86) and Eq. (3.75) 
show that the operators P 1 * and M ftv satisfy the relations 

[p.",p"] = (3.90a) 

[M'"\ P p ] = i(g vp P p - g pp P v ) (3.90b) 

[M f,v , M p "} = i{g vp M pa - gf p M"° - g""M f,p + g fl °M" p ) (3.90c) 

Note, the last commutation relation is the same as Eq. (3.74), but re-expressed in 
terms of M pv instead of I; and K;. They constitute a closed Lie algebra, as they 
should to make a group, and thus are generators of the Poincare group. From 
Eq. (3.75) and Eq. (3.90), we see that the momentum as well as the angular mo- 
mentum operators, J = (M 23 , M 31 , M 12 ), commute with the Hamiltonian (H = 
P°) and hence are constants of motion, but the Lorentz boost operators {If = 
(M 01 , M 02 , M 03 )} are not. We can make two Casimir operators that commute with 
all the generators: 

m 2 = P^pf (3.91a) 

W„ W = -J(J + l)m 2 , W /t = — £ /lvpo M vp P° (3.91b) 

W/i is called the Pauli-Lubanski operator. Equation (3.91) can be used to define the 
mass and spin of a particle. It can be verified that 

[p.",W v } = (3.92) 

Therefore we can diagonalize the mutually commutable operators simultaneously, 

P p ,Wi,P 2 ,W 2 (3.93) 

and use their eigenvalues to specify a one particle state. Particle states may be clas- 
sified by the value of m 2 and P p as follows: 

1. p 2 = m 2 > ordinary massive particle 

2. P 2 = m 2 = photon, graviton, etc. 

3. P 2 < tachyon or virtual particle '' ' 



Only particles in categories 1 and 2 are physical particles and we consider them 
only. For particles with m 2 > 0, we can also specify their spin S. In their rest 
frame, P'' = (m, 0,0,0), W; becomes 

W, = -\E ivpo r p P° = -ye ijk0 J jk = mSi (3.95) 
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and the state can also be specified by its spin components. However, for particles 
with m 2 = including photons and gravitons, which fly always at the speed of 
light, the notion of rotation in the rest frame cannot be applied. As 



P'' and W are orthogonal and both are on the light cone. So if W ft is a real vector, 
it means they are proportional: 

W'=1P'' (3.97) 

Therefore, the particle state with m 2 = is characterized by one number, which 
is proportional to W/P'' and has the dimension of angular momentum. It is 
called helicity (A = J 3 along the momentum direction). With the parity operation 
(if conserved), the helicity can be flipped. It can be shown that J 3 = ±\J\. The 
photon has J = 1 and its interaction preserves parity. Therefore the photon can 
have Ji = ±1, but a state with J 3 = is not allowed, which means the photon is a 
quantum of a transverse wave. In summary, a massless particle can have only two 
states, specified by its helicity. For the case of a spin 1/2 massless particle, which 
is known as the Weyl particle, we shall give an explicit expression of the state in 
Sect. 4.1.2. 

In summary, in the framework of special relativity, a particle state can be spec- 
ified by its energy-momentum, its spin value s, and spin components s z . If the 
particle is massive, s z has 2S + 1 components but if it is 
only two components. 



4 

Dirac Equation 



Relativistic Schrodinger Equation 

4.1.1 

Dirac Matrix 

We have seen in Chap. 2, that the relativistic wave equation encounters problems, 
such as negative probability and negative energy, if we try to interpret the wave 
function as a probability amplitude. The difficulties can be avoided by interpret- 
ing it as an equation for a (classical) field. Historically, a relativistic version of the 
Schrodinger equation without negative probability was first obtained by Dirac. It 
also turned out to be the basic wave equation for the spin 1/2 particle. Since quarks 
and leptons, the fundamental particles of matter, have spin 1/2, we have to under- 
stand the structure of the spin 1/2 field before we quantize the Dirac fields. For the 
moment, we follow Dirac's argument to derive a relativistic Schrodinger equation. 
His heuristic argument helps in understanding the physical meaning of otherwise 
mathematical equations. Dirac's logic was as follows. 

1. The negative probability originates in the fact that the Klein-Gordon equa- 
tion contains second time derivatives. Therefore, the equation must be a linear 
function of the first time derivative. Relativistic covariance requires that space 
derivatives have to be linear, too. 

2. The wave function tp should have more than one component. Because the 
equation of motion generally includes second derivatives and they can only be 
included by introducing coupled multicomponent equations. 1 ' If we define the 

1) This can be understood, for instance, by combined to a single linear equation. 

equation 3 2 <j/3t 2 = -«?q which includes a if = = M R = f icol 

second derivative. By introducing q-i = q, dt \_qij ' \_ioJ J 

<}2 = —i{Sq/dt)/oj, it can be converted to two (yi 

simultaneous equations, dqi/dt = iojq 2 , 
dq 2 /dt = iojqi. In matrix form, they can be 
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probability as 



P = X^r*Vr = 1p f 1p, 



p stays positive definite. 
3. The Klein-Gordon equation (i.e. the Einstein relation E 2 



Applying (1) and (2), the equation must be of the form 






Here, a, and /3 are matrices, p is a differential operator but becomes a c-number 
when we apply it to a plane wave ip(x) = Ae~ lpx = Ae~ l£t +r*. Since any wave 
can always be decomposed into plane waves, we shall use (E, p) interchangeably 
between differential operators and c-numbers unless there is confusion between 
the two: 

p fl = (E,-p)oid ll , p" = (E, P )OidK=(i-^,-iv\ (4.4) 

The Schrodinger equation (4.3) can be expressed in the form 

Eip = (a-p + (Jm)ip (4.5) 

Statement (3) requires 

3 3 

E 2 = p 2 + m 2 = J^ («i»j + «j«i)i>V + m^(/3a, + a^p' + /3 2 m 2 

(4.6a) 



.-. aj = {3 2 = l, aj Oj+OjO,=0 (i^j), Pai + ai/3 = (4.6b) 

Matrices that satisfy Eq. (4.6b) are called Dirac matrices. Their dimension has to be 
an even number. 

Proof: 

(a) Eigenvalues of a ir ft are ±1. 

Y ai <p = A<p ^ a 2 cp = cp = X 2 <p, :. X = ±l (4.7) 

(b) a.i, (5 are traceless. 

Y Tr[a [ ] = Tr[a i/ 3 2 ] = Tr[/3a^] = -Tr[a i/ 3 2 ] = -Tr[a [ ] (4.8) 

The third equality follows because a cyclic order change does not affect the trace 
value and the fourth equality uses the anticommutativity of a; and /J. (a) and (b) 
together restrict the dimension to be an even number. □ 
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4.1.2 

Weyl Spinor 



: 0, a ; = ±<7; satisfy the Dirac condition (4.6b), where ff; are Pauli matrices: 
i-£- = ±e-(-iV)1> (4.9) 

r° *i r° -»i r 1 ° i 



10 -1 



(4.10) 



This particular form is called the Weyl equation. We shall consider a plane-wave 
solution of the type tf>(x) ~ u(p)e~' px , where u(p) is a two-component wave func- 
tion. We distinguish two solutions L and R , corresponding to different signs of 
the Weyl equation (4.9). They satisfy 

E<p L = -a-pcp L , E<p R = +a--pcp R (4.11a) 

a, IpI* 7 "?^ a. IpI^'P^, U11 „ 



H- lp f = -P (4.12) 

is a spin projection operator in the direction of motion and is called a helicity oper- 
ator, p is a unit vector in the direction of p. Classically, the helicity eigenstate with 
h = (+) represents a moving state rotating clockwise 2 ' and the particle is said to 
be right handed. Similarly, the helicity (— ) state is left handed. 

Problem 4.1 

Let p be in the direction (0 ,(p). Then the helicity operator is expressed as 

. T cosfl sin 0^1 ,. ... 

"'=[ime*» -cosflj < 4J3 > 

Show that eigenstates %± of the helicity operator are expressed as 






As m = 0, \p\/E = ±1. Choosing %— for L gives 



2) The orbital; 

viewed frorr 
attributes. 
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In other words, the solution cp L to the Weyl equation is a pure helicity state of a par- 
ticle but flipped for its antiparticle (E < 0). The exact definition of an antiparticle 
will be given in the next section. We shall use it to denote the negative energy par- 
ticle. <p L does not contain a x+ component for the particle and no X- components 
for the antiparticle. We have now justified the use of L/R for the subscript. We call 
it the chirality (— ) state, an extended helicity concept applicable to the antiparticle. 
It will be discussed in more detail in Sect. 4.3.5. R is defined similarly. L and 0r 
are solutions of different equations and hence are particles independent of each 
other. They are called Weyl particles. 

However, the two equations (4.11a) and hence L and R interchange by a parity 
transformation: 

*«>-*, p^-p (4.16) 

Therefore the solutions cannot be applied to particles that have strong or electro- 
magnetic interactions, as they are known to conserve parity. The Weyl solution 
could describe the neutrino, which interacts only weakly apart from the universal 
gravity. 3 ' To describe particles in the real world, we have to combine the two solu- 
tions. Let us extend ip from a two-components to 4-component wave function and 
define 



[::]•«-[-; a- *-p a 



(4.17) 



P is a matrix to exchange R and <p L . The above a and /3 satisfy the Dirac condition 
(4.6b) and so the equation using a and (3 defined above satisfies all the require- 
ments for describing relativistic particles having finite mass. It is called the Dirac 
equation. Note that the way to determine « and /3 is not unique. Different «' and 
/?' defined by transformation matrix T 

a' = TaT~\ /3' = T/3T' 1 (4.18) 

satisfy the same Dirac conditions. The matrix form in Eq. (4.17) is called the Weyl 
representation. Another representation commonly used in many textbooks is the 
Pauli-Dirac representation, which is given by 



(4.19) 

The physics content does not depend on which representation is used, but it is 
easier to proceed with an explicit representation. In this book we use the Weyl rep- 
resentation unless otherwise noted because it diagonalizes the energy-n 

3) However, the phenomenon of neutrino oscillation, discovered in 1998 [357], is evide 
finite mass of the neutrino, hence the possibility of the neutrino being a Weyl particle 



4. 1 Relativistic Schrodinger Equation I 

term and is more suitable for describing relativistic (E,p » m) particles. The 
Pauli-Dirac representation diagonalizes the mass term and is convenient for de- 
scription in the nonrelativistic energy region. Let us proceed. The Dirac equation 
is given by 

dip 
i-£ = (-« • iV + MV O EiP = [« • P + MV ( 4 -20) 

Now we try to construct a probability current by assigning 

j f '(x) = (p(x),j(x)) (4.21a) 

p(x) = ^(x)ip(x), j(x) = ip'<(x)aip(x) (4.21b) 

Using the Dirac equation (4.20) 

dp dtp J( + dip 

■£ = ~irW + W { -fr = -i{-{Hipfl> + ipl Hip} 

dt dt dt 

= -i[-{ ( -ia ■ V + PmWPV + vH(-i« ■ V + /8m) V}] (4 ' 22) 

= -V-(V t a^) = -V-j 

so the current defined above respects the equation of continuity. Since p is always 
positive definite, the interpretation of ip as a probability amplitude wave function 
is possible. Dirac seems to have succeeded in constructing a relativistic version of 
the Schrodinger equation. 

Problem 4.2 

Consider Eq. (4.20) as expressed in the Heisenberg picture. Using the Heisenberg 
equation of motion 

dO 

— = i[H, O] (4.23) 

and setting O = x, show that dx/dt = a. 

Therefore a can be considered as a velocity operator. This also means j '' = (p, j ) 
can be interpreted as a classical four-component current (p, pv). 

I Problem 4.3 
Prove that H = a ■ p + jim does not commute with L but does with J, where 
I=xxp = xx(-iV), J=I+S = I+lr, Z=\ a °1 (4.24) 



S satisfies the commutation relation of the angular momentum. It can be c 
sidered as a four-component version of the spin operator with s = 1/2. 
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KroDiem 4.4 

Show that the four-component helicity operator h = E ■ p commutes with the 

Dirac Hamiltonian. 

Therefore, the helicity eigenstate is also an energy eigenstate of the Hamiltonian, 
and its quantum number can be used to specify the quantum state of the spin 1/2 
particle. 

4.1.3 

Interpretation of the Negative Energy 

Spin of the Dirac Particle 

There still remains the negative energy problem, but first we will try to understand 
what kind of freedom the wave function possesses. In the limit of p -> 0, the Dirac 
equation, Eq. (4.20), becomes 

BV = »^ = w[J J]v (4-25) 

Solutions are easy to find: 



where | r is a two-component spin wave function 

ip± r are eigenstates of the Hamiltonian as well as the four-component spin oper- 
ator S defined in Eq. (4.24). Namely, 



S^±,+ = 2^±,+ SsVi.- = — V±_ ( 4 -28b) 

We now have proved that the Dirac equation describes a particle of spin 1/2, but 
also has a freedom corresponding to the sign of the energy. The negative energy 
state seems unavoidable. 

Consequences of the Dirac Equation 

Dirac applied his equation to the problem of energy levels in the hydrogen atom. 
By solving the Dirac equation including a static Coulomb potential, he obtained 
the energy spectrum given in Fig. 4.1 and explained the fine and hyperfine struc- 
ture of the energy levels, which were all degenerate according to the nonrelativistic 
Schrodinger equation. 4 ' He also investigated the property of the electron in a mag- 

4) Actually Dirac did not calculate the energy levels of the hydrogen when he published the paper on 
his equation. Later, in his Oppenheimer memorial talk in 1969, he confessed that he did not do 
it because he was afraid his equation might be disproved by detailed calculations. The author is 
grateful to Professor T. Kubota for pointing out this historically interesting anecdote. 
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Figure 4.1 Low-lying energy spectrum of to scale. Note that the hyperfine splitting of 

the hydrogen atom. Degenerate states with IS 1 ' 2 corresponds to an energy splitting of 

the principal quantum number n are sepa- 5.89 x 1(T 6 eV = 1420 MHz = 21 cm radio 

rated by spin-orbit coupling (fine structure) wave, which is extensively used in radio as- 

and spin-spin coupling (hyperfine struc- tronomy to probe the hydrogen abundance in 

ture). The Lamb shift is due to a higher or- the universe, 
der QED effect. The diagram is not drawn 

netic field and succeeded in explaining the g = 2 value of the moment, which is 
described in Sect. 4.3. By his great success, he must have convinced himself of the 
correctness of his equation. Then the negative energy puzzle must correspond to 
some physical reality and its apparent contradictory property should have some ex- 
planation. The contradiction stems from the fact that if a particle is in a negative 
energy state, it can lose further energy by emitting photons and falls down without 
limit and hence cannot be stable. 

Dirac considered that the vacuum must be a state where all the negative energy 
levels are occupied. Then, because of the Pauli exclusion principle, electrons cannot 
fall down to negative energy levels and thus become stable. However, the concept 
opened a new possibility. When an energy greater than twice the electron mass is 
given to the vacuum, an electron in the negative-energy sea is excited to a positive 
energy level and a hole is created in the sea (Fig. 4.2a). When an electric field is 
applied, a negative energy electron next to the hole moves in to fill the hole and 
creates another hole where it was before (Fig. 4.2b). The process is repeated and the 
net effect is that the hole moves in the opposite direction to the electron. It behaves 
as an ordinary particle with positive energy, but having charge and momentum 
opposite to the electron. Its spin component is also opposite, because when the 
hole is filled, the quantum number is that of the vacuum. Thus, Dirac predicted 
the existence of the antiparticle and that a pair creation of a particle and antiparticle 
from vacuum will result if enough energy is injected. Thus materialization of pure 
energy became a possibility, as was already implicit in Einstein's equation E = 
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energy level, (b) The hole r 



ergy electrons. A hole is created by exciting 
Dves in the opposite direction to the electro 




Figure 4.3 Anderson's discovery of the 
positron in a Wilson cloud chamber [21]. 
The incident charged particle in the cosmic 
rays leaves a curved track in the magnetic 
field. The particle loses energy after passing 
through a 9 mm thick lead plate, leading to a 
larger curvature (smaller radius). Therefore 
it has passed through the plate from below. 



From the orientation of the magnetic field and 
the direction of the bend, the charge of the 
particle is determined to be positive. From the 
difference of the curvature before and after the 
passage of the plate, the energy loss, hence 
the mass, of the particle can be deduced. It 
was concluded that it had much lighter mass 
than the proton. Photo from [22]. 



Dirac considered first that the antiparticle might be the proton, but it soon be- 
came clear that the antiparticle must have the same mass, same spin, but opposite 
spin component as well as the opposite magnetic moment. In 1932, CD. Anderson 
discovered the positron [21] (see Fig. 4.3), a positively charged particle that has the 
same mass as the electron, in cosmic rays using a cloud chamber and confirmed 
Dirac's theory. 



Particles Traveling in the Opposite Time Direction 

Although the existence of the antiparticle was confirmed experimentally, Dirac's 
hole theory cannot be applied to bosons, for which the Pauli exclusion principle 
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Figure 4.4 The antiparticle is (a) a hole 



does not apply. 5 ' To avoid the use of the Pauli exclusion principle, let us take a 
minute to consider how an electron with negative energy behaves in the hole the- 
ory. Suppose an electron and an antiparticle having energy-momentum pi and q\ 
are scattered into a state with p 2 and q 2 (Fig. 4.4a). The process can be expressed 
by 



f qi ->■ Pi 



(4.29) 



In the hole theory, a hole with qi became a hole with q 2 , as depicted in Figure 4.4a, 
but this is a reinterpretation of an event where an electron with — q 2 was scattered 
into an electron with —q\. Writing the equation in terms of the original electron 



Pi + (-qi)^P2 + (-qi) 



(4.30) 



Here the scattering of the electron with (—q 2 ) precedes the filling of the hole (— q{). 
Equation (4.29) and Eq. (4.30) are equivalent mathematically. What actually hap- 
pens is Eq. (4.29), but we can consider it as Eq. (4.30), but with the negative particle 
traveling backward in time. The interpretation is that when a negative-energy par- 
ticle of energy-momentum —p* 1 = (— E, — p), E > 0, travels backward in time, 
it is observed as an antiparticle of positive energy-momentum pf" traveling in the 
normal time direction, as is depicted in Figure 4.4b. The arrow indicates the time 
direction. 

The interpretation is valid for bosons, where no Pauli exclusion principle can 
be applied. Of course, in reality, there are no backward-traveling particles. This is 
an example of how one has to make a physically correct interpretation of a math- 
ematical equation that exhibits an apparent contradiction. Feynman and Stiickel- 
berg abandoned the idea of the negative-energy sea and only applied the concept of 
backward traveling to the negative-energy particle. 

5) The concept of holes is useful and is routinely used in today's condensed matter physics. In 
the ground state, at zero absolute temperature, all the electrons occupy states from the lowest 
level up to the Fermi level, which may be called the vacuum state. Then, at finite temperature, 
some electrons are excited to above the Fermi level and holes are created. Many phenomena are 
successfully described using particle-hole ir 



Let us consider a wave field instead of a wave function, obeying the spinless 
Klein-Gordon equation for simplicity. The wave with negative frequency, i.e. p 1 ' = 
(— E, —p), E > 0, can be written as 



'■*V 



l{(-B)(-t)-(-p)-(-x)) 



(4.31) 



This mean 
backward i 
direction a; 



a wave with negative frequency, i.e. p fl = (—E,—p), E > 0, traveling 
. time as well as in space direction gives a wave traveling in the same 
that with positive frequency. A current created by a plane wave field 
x is expressed as 



ie(0*9"0 - 3" 0*0) ~ e|N|V = e|N| 2 (£,p) 






The plane wave with negative frequency p 1 ' = (— E, —p) gives a current 

j» = e \N\ 2 (-\E\,-p) = -e\N\ 2 (\E\,p) (4.33) 

which is the same current as the positive energy current, only the charge is oppo- 
site. If we interpret tp not as a relativistic probability amplitude but as a classical 
wave field, the negative energy simply means negative frequency and presents no 
difficulty in its interpretation. 

The concept of backward time traveling, though unnecessary once quantized 
field theory is introduced, brings in a new concept about scattering phenomena. 

Consider a particle A proceeding in time. It will behave as described in Fig. 4.5a, 
where kinks denote interactions. In the conventional way of thinking, the particle 
never travels backward in time. Using the new concept of backward travel for a 




Figure 4.5 If backward time travel is allowed, 
a particle-antiparticle pair appears, (a) Ordi- 
nary scattering, (b) When backward travel is 
allowed, a scattering backwards in time oc- 
curs. From the observational point of view, an 
antiparticle B and a particle C appear at time 



t = tj. The antiparticle and the original par- 
ticle A annihilate each other and disappear at 
time t = t 3 . (c) Within an interval allowed 
by the Heisenberg uncertainty principle, any 
number of particles can appear. 
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negative-energy particle, a process like Fig. 4.5b is allowed. Observationally no par- 
ticle travels backward in time. What really happens is that an antiparticle B and a 
particle C appear at time t = t 2 , and subsequently the particle B annihilates with 
the original particle A and disappears at time t = t^. The particle C takes over the 
part of A and proceeds onwards. A pair creation from vacuum has occurred. As a 
matter of fact, any number of particles can be created in a time interval allowed 
by the Heisenberg uncertainty principle. The charge has to be conserved, which 
means the backward-traveling particle must have the opposite charge. 

What we have learned is, whether a sea of negative energy or backward travel 
in time is considered, a negative-energy state forces us to go from a one-particle 
description to a multiparticle world. A backward-traveling particle induces vacuum 
fluctuation. The vacuum is no longer a stable static state, but rather like boiling 
water, constantly creating and annihilating pairs of particles and antiparticles. 

4.1.4 

Lorentz-Covariant Dirac Equation 

The Schrodinger equation involves special treatment of the time variable or energy 
and its form is not manifestly covariant. For a covariant treatment, we multiply 
both sides of Eq. (4.20) by ji and rearrange the equation to get the Dirac equation 
in a covariant form: 

(y"p fl - m) V = (y"id u - m) xp(x) = (4.34) 

where we have defined the Dirac y matrices by 

v° -/»-[; J]. Y = (r\r 2 ,r 3 ) = P« = [° a J] (4-35) 

They satisfy the following relations 

yfy v + y v yf = g' ,v (4.36) 

y°" f = y°, y<=-y (4.37) 

Equation (4.36) is called the Dirac condition for gamma matrices. To obtain a con- 
jugate form of the Dirac equation, we take the hermitian conjugate of Eq. (4.34) 
and multiply by y° from the right. Using y°y' r '(y° = y' 1 we obtain 

Tp(y /i id fi + m) = 0, yj=ip''<y° (4.38) 

xp is called the adjoint of xp . The equation of continuity is now expressed as 

9,, jf = d /l (ipy / 'ip) = (4.39) 



The form suggests that the current xp y f xp is a Lorentz 4-vector. Indeed if we define 

S"" by 

S""= l -lY",y v ], S" v = -S v " (4.40a) 

2 |_0 <rj 2 2 ' 2 L ff J l ' 

they satisfy the commutation relations Eq. (3.74) for Lorentz group generators. 
Therefore 



p (-£©„» S" v ) = exp(« • 7 /2 - ir • 0/2) 



is a Lorentz operator acting on xp(x). The above formula shows that the rotation 
operator is unitary but the boost operator is not. 6 ' Using Eq. (4.41) and the Dirac 
condition Eq. (4.36) for the gamma matrices, we obtain 

S _1 = y°S t y° (4.42) 

A Lorentz-transformed spinor takes the form [see Eq. (3.69)] 

xp(x)\xp'(x) = D(L)xp(IT l x) (4.43) 

or equivalently, expressed in terms of S(L), 

ip(x)^ip'(x') = S(L)xp(x), lp(x)^lp'(x')=lp(x)S~\L) (4.44) 

A bit of algebra shows xpxp and xpy^xp are a Lorentz scalar and a vector, respec- 
tively. Although xp itself is not a vector, its bilinear form with constant matrices 
sandwiched in between behaves like a vector (or a tensor) and is called a spinor. 

Problem 4.5 

For Lorentz boost in the x direction Eq. (4.41) becomes 

S(L X ) = exp(a xVx /2) (4.45) 

Show xpy^xp follows the same transformation law as given in Eq. (3.62) and that 
xp xp remains unchanged. 

More detailed treatment of the spinor is given in Appendix A. Loosely speaking, 
we may say the spinor is like the square root of the vector. Since 

Ipyfxp Xlp'yf'xp' = xpS^y^Sxp = U' v ~xpy v xp (4.46a) 

6) This is related to the fact that the Lorentz group is not a compact group. The range of the 
realize states that transform as unitary representations of the Poincare group. 
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we may simply write 

yf Xy*' = S^y^S = V v y v (4.46b) 

In this sense we also refer to y M as a Lorentz 4-vector, but it is meaningful only 
when it is sandwiched between spinors. 

Now we prove the Lorentz covariance of the Dirac equation. By a Lorentz trans- 
formation Eq. (4.34) should take the form 

(y"id' /l -m)rp'(x') = (4.47) 

Inserting (4.44) and using 

9;, = (I -1 )^ (4.48) 

Eq. (4.47) becomes 

[y"(L- 1 )f,i9 p -m]S^W = (4.49) 

Multiplying by S _1 from left and using Eq. (4.46b), we obtain 

[(S-VSHI-^tfp - m ] ip(x) = [L f UL-Xr°id P - m]y(x) = 

.-. (y"id /l -m)ip(x) = (4.50) 

Thus the Dirac equation has been proved to keep its form under the Lorentz trans- 
formation. 

4.2 

Plane-Wave Solution 

Rewriting the four-component Dirac equation again in terms of two-component 
wave functions, we obtain 

i(d -<j-V)<p L (x) = m<p R (x) (4.51a) 

i(d + <T-V)<p R (x) = m<p L (x) (4.51b) 

See Eq. (4.17), (4.34) and (4.35) to obtain the above equations. We see that the mass 
is the cause of mixing of </> L and </>r. 7) For plane-wave solutions, </> L and R can be 

7) The reason why the two helicity states mix the particle is moving backwards but rotating 

when the mass is finite can be understood in the same direction. Therefore the helicity 

as follows. The velocity of a particle with status is inverted. The amplitude for mixing 

finite mass is always less than the speed of is proportional to ~ m/p. If m = 0, there is 

light c. Therefore it is possible to transform no such coordinate system and hence the 

to a coordinate system that is flying faster helicity does not mix. 
than the particle. In that coordinate system, 
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expressed as 

L (x) = N0i.(p)6-"* (4.52a) 

R (x) = N0 R (p)e~ i '' x (4.52b) 

They satisfy the following coupled equations: 

(p° + a ■ p)0 L (p) = m<p R (p) (4.53a) 

(p° - a ■ p)fa(p) = m<p L (p) (4.53b) 

From Eq. (4.53a), we can express R in terms of L 

</>r= -(p° + a-p)<pi (4.54) 

Then inserting this equation into Eq. (4.53b), we obtain 

^(p° -a-p)(p° + o-p)<p L = ^(p° 2 - p 2 )cp L = m<p L (4.55) 

Therefore, L satisfies the Klein-Gordon equation. Similarly <p R satisfies the 
Klein-Gordon equation too. To obtain the solution to Eq. (4.53), we assume 
p° = E > and rewrite E,p in terms of rapidity. Using E = my — mcosh??, 
p = mj3yp = m sinh )/p [see Eq. (3.65)], we have 

E ± a ■ p = m (cosh r/ ± sinh rja-p) = m exp(±<7 • j/) (4.56a) 

V= z:\ri— — -£ P=7^t. P = ljp|. 7 = '7P (4.56b) 



Note: When dealing with a transcendental function of matrices, we 


can always 


expand it in series. Thus 




exp( < r.f / ) = l + (<r-f7) + L_^_ + L_^_ + ... 


(4.57a) 


Using 




(<7-i/) 2 " = r] ln \, (a-rj) 2n+1 = ?7 2 " +1 (<7-£) 


(4.57b) 


and rearranging the series, we obtain 




=KH + ("4+--H> 


(4.57c) 


= cosh t] + sinh t] (a ■ p) 
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Let us introduce four-component spin vectors o fl , o fl and dot products by 

o» = (l,a) = o fl , o" = (l,-a) = o fl (4.58a) 

p ■ a = p 1 * 0^ = E — a- p , (4.58b) 



p -o = p^Of, = E + a- p (4.58c) 

Then multiplying by Jp ■ a = ^/mexp(— a • tj/2) on both sides of Eq. (4.53a) and 
using (E — a ■ p)(E + a ■ p) = E 2 — p 2 = m 2 , we obtain 

y/p • ocp L = VP • CT 0R ( 4 -59) 

which has solutions 

L = NVp^CT| r = Nymexp( — <7-f/j! r (4.60a) 

R = N^|) -o'E, r = N v / mexp(+-<7-i/ ) § r (4.60b) 

ig the two two-component wave functions to obtain a four-component 



solution 



u,(p) = 



L0RJ, 



= N 






N^ 






H[t] 



(4.61a) 



where f r , (r = 1,2) is a basis of two-component spin eigenstates, and N is a 
normalization constant taken as N = 1 because we adopt normalizations such 
that there are 2E particles per volume. (See Sect. 6.5.2 for detailed discussion.) 
Comparing with Eq. (4.41), we see the plane-wave solution ip(x) ~ u(p)e~ lpx is 
nothing but a Lorentz-boosted spinor from the rest frame. For spin functions, it is 
often convenient to use the helicity eigenstate %± instead of §,-. In that case, p ■ a 
and p ■ a acting on them are pure c-numbers. 

For the solution with p° = — E < 0, we use e lpx = e'( £t- f*) for a plane-wave 
expression instead of e~' px \ p o = _ E = e l ( £ +P' x ) and invert the momentum together 
to represent a particle propagating in the same direction as that of positive energy. 
Replacing (p° ± a ■ p) —*■ — (E ± a ■ p) in Eq. (4.53), the solution becomes 



s/T^Vr 



£* (4.61b) 



where rj, a flipped spin state, was chosen instead of | for reasons which will later 
become clear (see Sect. 4.3.4). Sometimes, a somewhat different expression for 
u r (p) and v s (p) may be useful. Going back to Eq. (4.60) and using 




cosh — T sinh — a ■ rj = cosh -llT tanh — a ■ p) 
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the plane-wave solutions are expressed as 

Urip) 



T T °!jr~\_ j E + m |"(l-^)gr 



Vr(j») = 






J _v 2 L-( i+ *)"J 



These solutions satisfy the following relations: 

u,-(p)u s (p) = 2md rs , K(p)u s (p) = 2£<5 rs 

vAp)MP) = "2m<3 rs , vj(p)v s (p) = 2Ed r5 

u,-(P)MP) = Vr(p)u s (p) = 

K(p)v s (-p) = vl(-p)u s (p) = 

Note: u\(p)v s (p) ^ , v J(p)u s (p)^0 



(4.64a) 

(4.64b) 
(4.64c) 
(4.64d) 
(4.64e) 



Problem 4.6 
Prove Eq. (4.64). 

If we take the z 
all the plane-wave 



n the direction of the n 

ons can be written down concisely. 



Problem 4.7 

Using c-number p ft , prove 

(/-m)u(p) = 0, u(p)tf-m) = 

tf+m)v(p) = 0, v(p)(/+m) = 

i= y^Pf* 

The notation jf is referred as the Feynman slash. 



■El— El- 



(4.65a) 
(4.65b) 
(4.65c) 



Problem 4.8 

Denoting the a-th component of u r (p) as {u r (p)} a , show 

i A +U - i E \Mp)} a \u r (p)U = V-4zf\ 



\A- 



■ Y,{Vr(p)}a{Vr(p)h = 
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are projection operators for picking up positive and negative energy solutions re- 
spectively. Namely 

A + u r (p)=u r (p), A+v r (p)=0, A-u r (p) = 0, 

4.67 
A-v r (p) = v r (p) 

The properties of projection operators are 

A + +A- = l, A\=A + , A 2 _ = A-, A±Azf = (4.68) 

Equations (4.66) mean 

J2 u r (p)u r (p) =y + m , J2 v r (p)v r (p) =/- m (4.69) 

which will appear frequently in the calculation of the transition amplitude. 



4.3 

Properties of the Dirac Particle 

4.3.1 

Magnetic Moment of the Electron 

To understand the nonrelativistic properties of the Dirac particle, let us derive a 
low-energy approximation of the Dirac equation. We apply it to the electron, which 

has electric charge q = — e. Setting %p = , first we write down the equation 

in terms of two-component wave functions. We also make the substitution p^ ->• 
Pa + eAp to include the electromagnetic interaction: 



{y u (id fl + eA fl ) - m}xp(x) = {(id + e0)y° - y ■ (-iV + eA) - m}ip(x) 

T -m ; 9o . 

[id + e<p + a-(-iV+eA) 



r v+ ' A %: 



(4.70a) 

This is a set of two equations for the two-component spinors Ua, Mb- 

{id + ecj)-a-(-iV + eA)}u B = mu A (4.70b) 

{id + &<t> + o ■ (-iV + eA)}u A = mu B (4.70c) 

In the high-energy limit, where the mass can be neglected, equations for ua and 
u B are disconnected and represent two independent particles (u A ~ 4>l, »b ~ 0r)- 
But in the low-energy limit, the difference disappears and u A sa u B . For this reason 
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we introduce 8 ' 



with the expectation u_ -> in the nonrelativistic limit. Then the Dirac equation 

(id + ecp)u + - a ■ (-iV + eA)u_ = mu+ (4.72a) 

((9 + e0)u_-ff-(-iV+ eA)u + = -mu_ (4.72b) 

Up to now the equation is exact. At this point we choose the p° > solution and 

u + = V e-^+^ u_ = X e- l{m+T)t (4.73) 

Then we can make the substitution id — > m + T, where T now represents the 
kinetic energy. Substituting Eq. (4.73) in Eq. (4.72) and applying the nonrelativistic 
approximation T, t(p <^C m, Eq. (4.72b) becomes 

X -—a-(-i\' + eA)<p (4.74) 

Substituting this expression in Eq. (4.72a), we obtain 

T<p = T-e0 + — a - (-iV + eA)a ■ (-iV + eA) p (4.75) 

Then we make use of the relations 

(a-C)(a-D) = CD + ia(CxD) 

(-(V + eA) x (-iV + eA)<p = -ie(V x A + A x V)(? (4.76) 

= -ie(«V x A»^ + «V^» x A+ ((AxVjp))) = -ieity 

Here ((• • • )) means that the action of the nabla does not go beyond the double brack- 
et and B is the magnetic field. After inserting Eq. (4.76) in Eq. (4.75) and rearrang- 
ing, we obtain 



which is the Schrodinger equation including the electromagnetic potential plus an 
extra magnetic interaction term — fi ■ B. This means that the Dirac particle has a 
magnetic moment 

/i = ~Yn^ ff = g ii S (g=2) (478) 

8) This is equivalent to adopting the Pauli-Dirac representation in Eq. (4.19). 
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Using the general expression for the magnetic moment fi = g(q/2m)s, this means 
that the Dirac particle has a g-factor of 2. This solved the then mysterious value of 
g = 2 for the spin magnetic moment of the electron. This fact again defies the 
classical notion of a spin being the angular momentum of a self-rotating object. 
From classical electromagnetism, a circulating electric charge should have g = 1. 



The classical magnetic moment of a circulating charge can be calculated as 
follows. The Maxwell equation gives the value of the magnetic moment fi = 
I S to a closed loop current I with inner area S. Assume the current is made 
by a particle with charge q circulating at radius r. Taking its velocity v and 
circulating period T, or the frequency v = 1/ T, 



(I = angular momentum) 



Deviation of the value g from 2 is called an anomalous magnetic r 
instance, the proton and the neutron have magnetic moments of 2.79 and —1.91 
(in units of the nuclear magneton q/2m N ) and their deviation from the standard 
value is large, which exposes the nonelementary nature of the particles. Even for the 
Dirac particle g — 2 could be nonzero if we take into account higher order effects 
(radiative corrections) of the quantum electromagnetic fields (see arguments in 
Sect. 8.2.2). 

4.3.2 
Parity 

In Sect. 4.1.2, we saw that R and L are exchanged by parity operation and intro- 
duced the /3 matrix to do the action. Here, we treat the parity operation of the four- 
component Dirac equation 

(y"ia A ,-m)V(*) = (4.80) 

By parity operation, the Dirac equation is transformed to 

(y"i3/ - m)V>'{*') = (4.81a) 

x i> = (x i x) _» x f = {x i _ x) = Xfi (4 81b) 

V(x) -> xp'(x') = Sxp(x) (4.81c) 

So if there is a matrix S that satisfies 

S- 1 y°S = y°, S~ 1 y k S = -y k (fe = 1 to 3) (4.82) 
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We can claim that the Dirac equation is parity reflection invariant. Obviously 

S = rj ? y° (4.83) 

satisfies the condition as expected. Here, ?7p is a phase factor with the absolute 
value of 1. Applying the parity operation to the plane-wave solutions (4.61a) and 
(4.61b), we obtain 

Su r (-p) = rj v u r (p) (4.84a) 

Sv r (-p) = -r, P v r (p) (4.84b) 

which means that a particle and its antiparticle have relatively negative parity. Usu- 
ally 77 P = 1 is assumed and hence the particle has positive parity and the antiparti- 
cle negative. 

4.3.3 

Bilinear Form of the Dirac Spinor 

A hermitian bilinear form made of four-component Dirac spinors T = 'xpTip has 
16 independent components. We have seen V = ~ipy f 'ip behaves as a contravari- 
ant vector under proper Lorentz transformation. Under parity operation, 

V ' = ~ipS~ 1 y Sip = ~yJy y y°ip = lpy ip = V° (4.85a) 

V' = lpy yy ip = -Ipyip = -V (4.85b) 

Therefore, V is a polar vector. Introducing 

Yf< = g/a-y" ' . <V = giipgvaO pa (4.86) 

Equation (4.85) can be expressed as 

y« ^ y«' _ y /( ( 4 gyj 

All the 16 independent bilinear forms are classified according to their Lorentz 
transformation property and are summarized in Table 4.1. 

Problem 4.9 

Confirm the parity transformation property of each item in Table 4.1. 



Using (4.63), prove that the matrix elements (p'\r\p) = u(p')ru(p) in the n 
relativistic limit are given to the order 0(p jm) as shown in Table 4.2. 
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Table 4.1 Sixteen independent bilinear forms IpTip. 
Name Bilinear form # a After parity transf. Parity 



Scalar 


S 


W 


1 


ff 


+ 


Pseudoscalar 


P 


>V 7 !' 5 V ,b 


1 


-n/TyV 


- 


Vcclor 


V 


</>;«"</> 


4 


</';',« V' 


- 


Axial vector 


A 


i/';'V'/' 


4 


-ipY 5 y,<ip 


+ 


Tensor 


T 


ya^rp 


6 


tpa^tp 


+ 



Table 4.2 Nonrelativistic limit of the ; 
r {p'\r\p)/2m 



-l<7-(p'-p)V(2m) 



(p + p')/(2m) + li0X{p'-p)]/(2m 

-[a-(p' + p)]/(2m) 



ct'J = \[y\y'} EijkOk* 

a to _ i [r fc_ 7 0] , {{j ,/ _ p)/{2m) + [jff x (? , + j, )]/(2m )} 



Using the fact that in the nonrelativistic limit p sa mv, the above quantities can be 
expressed as 

(1}~1, (y°)~l, (7">~(l,v), (y"y 5 >~ ("•*,*) (4.88) 

These expressions will help in understanding the physical interpretation or dy- 
namical properties of (F). 

4.3.4 

Charge Conjugation 

Charge conjugation is the operation of exchanging a particle and its antiparticle. 
The fact that the signature of an antiparticle appears as the opposite sign in the 
phase of the wave suggests a connection with complex conjugate operation. The 
difference between the particle and its antiparticle manifests itself in the interac- 
tion with electromagnetic fields. Interaction with the electromagnetic field can be 
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taken into account by the substitution 

pf -+ p" - qA u (4.89) 

and that of the antiparticle by replacing q by — q. So if we can transform the field 
which satisfies the same equation of motion but with opposite charge, the opera- 
tion is the charge conjugation. Let us start with the simplest example, the Klein- 
Gordon equation. Substituting Eq. (4.89) in the Klein-Gordon equation, 

[(dp + iqA^)(d f ' + iqA 1 ) + m 2 ] = (4.90) 

Taking the hermitian conjugate of the above equation we have 

[(3„ - iqA u )(d>' - iqA 1 ') + m 2 ] 0? = (4.91) 

Comparing Eq. (4.90) with Eq. (4.91), we see that if <p describes a particle, then 0t 
describes its antiparticle, thus confirming our original guess. It also means that if 
is a real field, it cannot have antiparticle or it is its own antiparticle. It follows 
that a real field describes a neutral particle. The converse is not true. If a particle 
has a quantum number other than charge that differentiates the particle from its 
antiparticle, for example strangeness, the field is complex. 

For the case of the Dirac particle, it is a little more complicated since mixing of 
the components could result from the charge conjugation operation. Let us start 
from the four-component Dirac equation including the electromagnetic interac- 
tion: 

[y> , (id fl -qA fl )-m]ip=0 (4.92) 

If there exists \p c that satisfies the equation 

[y' i (id fl + qA /l )-m]ip c = (4.93) 

then xp c is the charge conjugate oixp. Taking the hermitian conjugate of Eq. (4.92), 
we obtain 

t [y Mt (-i^ M - qA„) - m] = (4.94) 
Multiplying by y ° from the right, and using the relation y>'~' r = y ° y <" y ° , we obtain 

yJ[-y t '(i% + qA /l )-m} = (4.95) 
Now, taking the transpose of the above equation 

[-y f,T (id fl + qA u ) - m\f = (4.96) 
and multiplying by C from the left, we obtain 

l-Cy^C^iX, + qA u ) - m\df = (4.97) 
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So, if there is a matrix C that satisfies the relation 

Cy'^C' 1 = -y" (4.98) 

we can define a charge conjugate operation and 

ip c = Clp T (4.99) 

satisfies the equation (4.93) for the antiparticle. 
Noting 

y /,T = y'> = 0,2), y* r = -y*(p = 1,3) (4.100) 

we can adopt 

C=iy V = [7 _l] (4.101) 

ia 2 flips the spin orientation. Therefore, the charge conjugation operation includes 
spin flipping, as conjectured in Sect. 4.1.3. This is why we adopted the flipped- 
spin solution for the negative energy wave function (4.61b). This C matrix has the 
properties 

C = C* = -C- 1 = -C T = -Ci (4.102) 

The charge-conjugated Dirac wave function is given by 

ip c = Clp T = iy 2 y°y° J ip* = iy 2 ip* (4.103a) 

lp c = ip c ~>y° = (Clp r yy = lp*C ( y° = ip T y°*C''y° = — xp 1 C~ x 

(4.103b) 

Naturally, an electromagnetic current should change its sign under the charge-con- 
jugation operation. In quantum mechanics, it is a definition and the sign has to be 
changed by hand, but in quantum field theory it comes automatically. 



j' ,c = qip c y>'ip c = -qxp C yi- l Cxp = qip y fl xp 

= -?(V 7 r'» T = -qWr^y = -j" 

The minus sign in the first equation of the second line comes from the anticommu- 
tative nature of the fermion field, which we will show in Chap. 5. We can remove 
the transpose in the penultimate equation because the quantity in the parentheses 
is a 1 x 1 matrix. □ 
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Problem 4.11 

Prove (u r ) c = CuJ = v r , (v r ) c = CvJ = u r 



4.3.5 

Chiral Eigenstates 



We define a chirality operator y 5 by 

, oi,, r-i oi 

y = iy u y y y = -iaia 2 a 3 = Q I (4.106) 

which satisfies the relation 

(y 5 ) 2 = l, y s y" + rV=0 (4.107) 

We also define chiral eigenstates, which play an important role in the weak inter- 
action, by 

It is easily seen that ?/> L and ^) R are eigenstates of the chirality operator y 5 

y 5 VL = -</'L, y 5 ^R = +VR (4.109) 

In the Weyl representation, the two chiral eigenstates are expressed in terms of the 
two-component spinors <pi and <p R separately. In other representations, they are 
not necessarily separated, but the following arguments are equally valid. Multiply- 
ing Eq. (4.34) by y 5 and using Eq. (4.107) we can obtain an equation for y 5 xp . Then 
adding and subtracting it from Eq. (4.34), we obtain 

y''i9^ L = mip R (4.110a) 

y"i3„VR = rmp L (4.110b) 



which reproduces the four-component version of Eq. (4.53). Since y 5 
mutes with all the y^'s (ft = to 3), it commutes with the Lorentz operator (4.41). 
Therefore the chiral eigenstate is invariant under the proper Lorentz transforma- 
tion. Under the parity operation 

xp(x) i ip'(x') = y'(* .-*) = yV(*) (4.1H) 

Since y° anticommutes with y 5 , Vl is converted to ^ R and vice versa. In the Weyl 
representation, <p L and R are the two chiral eigenstates. 
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Let us take a look at the spin properties of the plane-wave solution and how they 
are related to the Weyl solutions. We notice that in the high-energy or small-mass 
limit p -* E 



Mm P .a = 2fi( 1 - 2 g ^ ), limp.a = 2 £ (I±pP 

They are just projection operators to pick up the helicity T state. This mear 
and R approach the Weyl solution in the high-energy limit (p ->■ E or m - 
For p / E, ,/p • a, yfp -a preferentially pick up helicity T- components. To d 
the helicity expectation value of <p L , <p R , apply them to an unpolarized helicity 
i.e. fi, = Z+ + X-: 



(4.112) 



- PX+ = (<*#- + h X+) 




We note that L is primarily a negative helicity state, i.e. left-handed, but contains 
a small ~ 0(m/p) component of the opposite helicity state. The expectation value 
of the helicity is given by 



(0i>l) |a| 2 + |fe| 2 £ 

Similarly, we obtain (h(cp R )) = +v. For antiparticles, whose wave function is rep- 
resented by v (p ) , the helicity is inverted. The chirality is a Lorentz-extended version 
of the helicity operator, with its handedness inverted for antiparticles. Knowledge 
of the chiral characteristics comes in very handy in understanding weak-interac- 
tion phenomenology. Note, however, the chiral states are not the eigenstates of the 
equation of motion, because y s does not commute with the Hamiltonian. Helicity 
states are the Hamiltonian eigenstates, but they change their state by interactions. 
We have also seen in Eq. (4.110) that the two chiral states mix when the mass term 
exists in the Hamiltonian. But the chirality is Lorentz invariant and is conserved 
in vector interactions, which include the strong, weak and electromagnetic interac- 
) 
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Chirality Conservation in the Vector Interaction 

We prove that chirality is conserved in vector-type interactions before and after 
interaction. The photon, for instance, couples to currents 

- « tATyV o u f 1 ±y s \ 

v m y f VRm = v 1 [—r— ) y y [~~r~ ) v 
t o u ( l ^y s \ ( 1 ±y s \ 

n>y^m) = M> m y fl n> = n> m y^n>m + o (4.H6) 

In the high-energy limit, where the mass can be neglected, chirality is equivalent 
to helicity except for the sign, hence helicity is conserved, too. 

Note that an interaction that respects the gauge principle is obtained by the re- 
placement d M -*■ df, + iqAf,. Namely, the interaction Lagrangian has the form 
—qjf'Ap = —qlpyfyAp. It is good to remember that the fundamental interac- 
tion in the Standard Model is of vector type and preserves chirality. 

Comment 4.1 In high-energy experiments we are mainly interested in an energy range 
from lGeVto ITeV = 1000 GeV. The mass of the u, d, s quarks and that of the lep- 
tons (except the t) are less than ~ 100 MeV (see Table 2.2). They can, for all practical 
purposes, be regarded as massless particles. For them the chiral eigenstates are essential- 
ly equivalent to the helicity eigenstates (flipped for their antiparticles) and hence are 
eigenstates of the Hamiltonian, too. c, b quarks and the t lepton have mass in the few 
gigaelectronvolts range, and in many cases they can be treated similarly. Only for the top 
quark, whose mass is ~ 171 GeV, is the effect of the mass sizable. This is why we use 
the Weyl representation, which diagonalizes the chiral operator y 5 , instead of the Dirac 
representation, which diagonalizes the mass term fi. For historical reasons, the latter has 
been commonly used in most textbooks. It is convenient only in treating phenomena of 
relativistic origin in the nonrelativistic energy range. 



Quantum numbers that differentiate particles from antiparticles include electric 
charge, isospin, strangeness and lepton/baryon number. 

Examples e"*" / e~ (electric charge, lepton number) 

n/i) (magnetic moment, baryon number) 
K° / K (strangeness) 

jt° and the photon do not have such quantum numbers, and thus they cannot be 
distinguished from their antiparticles. Now, consider neutrinos. They belong to the 
lepton family and are considered to have lepton numbers just as charged leptons 
do. The lepton numbers 1 and —1 can be assigned to the neutrino and antineutrino, 
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respectively. Experimental observations seem to respect lepton number conserva- 
tion in neutrino reactions. For instance, v M and v ft are produced through the decay 
ofpions: 

jz + ->• fi + + v /l , jz~ ->• fi~ + v fl (4-117) 

and when they collide with nucleons, they produce fi~ (lepton number 1) and //"*" 
(lepton number —1) 

v /t + n^fi~ + p, v fl + p^[i + + n (4.118) 

in such a way as to conserve the lepton number. A v ^ does not produce a ft ~*~ if 
it collides with a proton. However, the weak interaction acts only on particles with 
negative chirality. 9 ' If the particle's mass is zero, it is a Weyl particle and is in a 
pure helicity (— ) state, while the antiparticle is in pure helicity (+) state. What was 
considered as lepton number conservation may have been nothing but a test for 
helicity conservation, although experimentally there is no evidence against lepton 
number conservation as far as the neutrino is concerned. 

Let us assume that the neutrino is a Majorana particle, a name given to a fermion 
that does not have its own antiparticle. Conditions for the Majorana particle are 

1. It satisfies the Dirac equation. 
(yHd ll -m)N(x)=0 

2. The particle and its antiparticle are identical. 



In the Weyl representation, we can put ip L = n and Vr — ■ Then define Ni 
and N 2 as follows: 

Na = Vl + (V0 C = [_ia 2 £*] < 4 - 120a ) 

N 2 = Vr + (Vr) c = \ 2V 1 (4.120b) 

They satisfy N x c = N\, N 2 C = N 2 . Applying the Dirac equation for N\ 

and rewriting the equation for two-component spinors, we obtain 

[do-o-V)£ = -m(o 2 £*) 

(4.122) 
(3o + <r-V)(a 2 |*) = m| 

re W^ is exchanged. The neutral current 
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The two equations are not independent. The second equation can be derived from 
the first. It is indeed an equation for a two-component spinor. Ni and N 2 have only 
two independent components despite their seemingly four-component structure. 
Similarly we can obtain the equation for N 2 : 

V ' (4.123) 

(d -a-V)(o 2V *) = -m? ] 

From the above argument, we see § and t] satisfy two independent equations. In 
the limit of m — >• 0, the equations become identical to the Weyl equation (4.9). 
If m / 0, the deviation from the Weyl solution is small for relativistic particles 
(m <JC £). | is in a chirality minus state and is predominantly left-handed. Similar- 
ly, rj is predominantly right-handed. Therefore, we call £ (rj) a left (right) -handed 
Majorana particle. Since they satisfy independent equations, their mass need not 
be the same in general (m L / m R ). 



Note, the Majorana particle is a quantum mechanical concept and there is no 
equivalent in classical mechanics. If we consider § or t] as expressed by pure 
classical numbers, it is straightforward to show that Equation (4.122) has no 
solution. This can be seen by inserting £ T = (a, b) and p = and solving 
the equation explicitly for a and b. Another way to see this is to note that the 
Lagrangian to derive the mass term should contain a term like ^ i a 2 £ * = 
i £ * { o 2 } i j £ * . However, a 2 is antisymmetric and it would vanish if § were an 
ordinary c-number field. In quantum field theory, £ is an anticommuting field 
operator and the contradiction does not appear. 



Let us see the effect of charge conjugation on the left- and right-handed Majorana 
neutrinos. After decomposing the four-component spinors N x and N 2 into N L and 
N R using the projection operators (1 T y 5 )/2 

N L =-(1-7 5 )N, N R =^(l-y 5 )N (4.124) 

we apply the charge conjugation operation to them: 

(N L ) C = cn t l = iy 2 y°lf L = iy 2 -(i - y 5 )N* 

-I" 

.-. (N L ) C = (N C ) R = N R (4.125a) 

(N R ) C = (N C ) L = N L (4.125b) 

The Majorana particle changes its handedness by the charge conjugation operation. 
No distinction exists between a particle and its antiparticle. The handedness can be 
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(b) 

Figure 4.6 Three possibilities for the neutrino: 

(a) four-component Dirac neutrino m ^ 0, 

(b) two-component Weyl neutrino (m = 0) 
and (c) two-component Majorana neutrino 



(c) Majorana Particle 



(m=0) 

(m ^ 0). P and C are the parity and charge- 
conjugation operations, respectively. L is a 
Lorentz boost. The neutrino states transform 
to each other by the designated operation. 



exchanged either by the parity or the charge conjugation operation. From now on, 
we adopt the convention that ip[ = (ip c )i, to denote left handed antiparticle for the 
Dirac particle and left handed particle for the Majorana particle. 



I Since i02%* (—i°2V*) satisfies the same equation as t] (§), it is a right (left)- 
handed particle. Show that if m L = m R then the chirality L(R) component of 
xjj = (N x + iN 2 )/V2 satisfies Eq. (4.51). 

Therefore the Dirac particle is equivalent to two Majorana particles having the 
same mass. Actually, having the same mass guarantees the Lagrangian to have 
an extra symmetry, namely invariance under phase transformation (ip -> ip' = 
e~ ia \p, ip'' ( -> ip'i' = e' a xp~' f ) and to have a conserved particle number according 
to Noether's theorem (see Sect. 5.2). 

We summarize relative relations of the Dirac, Majorana and Weyl particles in 
Figure 4.6. 

How Can We Tell the Difference Between the Dirac and Majorana Neutrinos? 

If the neutrino is a Majorana particle, lepton number is not conserved. The sim- 
plest reaction that can test it irrespective of the helicity state is a neutrinoless (Ov) 
double beta decay, in which a nucleus emits two electrons with no neutrinos: 

Ov/3/3 decay: (A, Z) -► (A, Z + 2) + e~ + e~ (4.126) 

A double beta decay that emits two neutrinos (2v) (to conserve the lepton number) 
appears as a higher order weak interaction and has been observed: 

Ivlifi decay: (A, Z) -+ (A, Z + 2) + e~ + e~ + v c + v e 

Another clue to distinguish the Majorana particle from the Dirac particle is the 
fact that the former cannot have a magnetic moment. This is because by charge 
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conjugation the magnetic moment changes its sign but the particle itself remains 
as it is. However, if there is more than one kind of Majorana particle they can 
transform into each other through magnetic interaction. This feature is sometimes 
applied to solar or cosmic magnetic field phenomena to test the neutrino property. 
A charged Dirac particle can have a magnetic moment, as already described for 
the case of the electron. A neutral Dirac particle can also have a magnetic moment 
through high-order intermediate states and also an electric dipole moment if time 
reversal invariance is violated. 10 ' According to the Standard Model calculation, the 
magnetic moment of the Dirac neutrino is [151] 

which is far below the observable limit with the present technology. However, if a 
new interaction exists which induces a large magnetic moment, it may be exposed 
by improved experiments. 



ill defer detailed discussion of til 



5 

Field Quantization 



Before we quantize fields, we need to understand the Lagrangian formalism and 
derivation of invariants when the Lagrangian respects a certain symmetry. Impor- 
tant invariants are the energy-momentum and the electric charge, whose underly- 
ing symmetries are translational invariance in space- time and ga 



5.1.1 

Equations of Motion 

In solving a particle motion in classical mechanics, we usually start from Newton's 
equation of motion, where the path of the particle motion is traced by successively 
connecting infinitesimal changes at each point. There is another approach based 
on the action principle, where all the possible paths between two points are inves- 
tigated and the one with the minimum action is selected. The two approaches are 
equivalent in classical mechanics. In quantum mechanics, however, because of the 
wave nature of a particle, the path chosen by the least action is the most proba- 
ble, but other paths are possible, too. The deviation from the most probable path 
is very small, of the order determined by Planck's constant, but it is quite sizable 
in the microcosm extending from molecules and atoms down to elementary par- 
ticles, i.e. in the world where quantum mechanics applies. A good example is the 
double slit interference phenomenon of an electron beam. Therefore, in our study 
of elementary particles, a logical choice is to start from the action principle. 

In the action principle, a particle's path from a point <ji = q(t\) to q 2 = q(t 2 ) is 
such that it minimizes the "action S" defined by an integral of the Lagrangian L 



-c 



L(q,q)dt (5.1a) 

L=K-V=]-mq 2 -V(q) (5.1b) 



o. KGaA, Weinheim 
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where K is the kinetic energy of the particle and V is some potential in which it 
moves. The action is a functional of both the position q and the velocity q of the 
particle, which, in turn, are functions of time t. Fixing the positions <ji and q 2 , 
we change the path q(t) by an infinitesimal amount dq — dq(t), then the small 
change in S (variation d S) should vanish if the action is at the n 






-/;■ 



61 
dq 


6q+ Tq 


dq] 


- d 

_dt 


m- 


dV 



We have used dq = 6(dq/dt) = (ddq/dt). Since dq(h) = dq(t 2 ) = and dq is 
arbitrary 



d_fdL\ 

Tt[j~q) 



which is called the Euler-Lagrange equation. One advantage of the Lagrangian for- 
malism is that the equation holds for a general coordinate system. Applying it to a 
harmonic oscillator where V(q) = m(u 2 q 2 /2, the Lagrangian becomes 



The Euler-Lagrange equation giv 



5.1.2 

Hamiltonian Formalism 

The Hamiltonian formalism is another way to derive equations of motion equiva- 
lent to the action principle and is particularly useful in deriving quantum mechan- 
ical equations of motion. In mechanics of a point particle, the Lagrangian uses q 
and q, while the Hamiltonian formalism chooses to use q and its canonical conju- 
gate momentum defined by 

dL 



as two independent variables, q and p are called canonical variables. Then 



If we define the Hamiltonian by 
H^pq-L 
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dH = d(pq-L) = - — dq + qdp = -pdq+ qdp (5.9) 

dq 

We used the Euler-Lagrange equation in deriving the last equation. From the above 
relation, it follows that 



Equations (5.9) and (5.10) mean the Hamiltonian is a constant of motion if all the 
time dependences are in p and q, namely if it does not have explicit time depen- 
dence. For a harmonic oscillator 

H = h + 1 2 ma)2q2 (5 - n) 

*=£. P = -mco 2 q =► q = -co 2 q (5.12) 

Eq. (5.12) reproduces the equation of motion for the harmonic oscillator Eq. (5.4). 

5.1.3 

Equation of a Field 

Let us extend the action to a system of N 3 particles oscillating independently that 
are distributed uniformly in a large but finite volume V = L 3 , each oscillator occu- 
pying a volume A V = (I/N) 3 . This is expressed as 

S = J dtJ2 Hqr, qr) = f dtJ2 Q»r€? " \^^W)j (5-13) 

Taking N 3 independent variations for <j r , we obtain N 3 Euler-Lagrange equations: 
d ( 6L\ dL 

TAw)-jq-r" (5 - 14) 

In the continuous limit where the spacing L/N goes to zero, the collection of os- 
cillators is called a "field". Instead of numbering by r, we may use the space co- 
ordinate x = (x, y, z) to number the oscillators. The nomenclature "field" is by 
definition some quantity defined at all space positions. In this book we restrict 
its usage to mean an infinite collection of oscillators distributed over all space. To 
make a connection between the field and discrete oscillators, the field <p(tx) may 
be represented by the average value over AV at r-th point, and associate q r with 
(p(x) scaled with the volume A V: 

many-particle system — ► field 
qr -> <P(x) 

numbering by r — > numbering by x v ' 
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<p(x) need not necessarily be space oscillators. It may represent any oscillating 
physical object, such as the electromagnetic field. Therefore the action is repre- 
sented as 

S = j dt\j d'x£ (0(*), 0(*))1 ee J d 4 x£(0(x), 3 t 0(x)) (5.16) 

In the last equation x is used to denote four space-time coordinates instead of x 
as is a function of time, too. X is called Lagrangian density. In field theory, the 
action appears as a 4-integral of the Lagrangian density, S = f dtL = f d 4 x£; we 
may call it simply the Lagrangian where there is no confusion. The field is an infi- 
nite number of some physical objects oscillating independently at position x. If we 
want a field whose oscillation propagates in space, namely a wave, each oscillator 
has to have some connections with its neighbors. Remember the mechanical mod- 
el of the field considered in Sect. 2.2.5. Receiving force from or exerting force on 
the neighboring oscillators was the cause of wave propagation. In the continuous 
limit, that means the oscillator is a function of derivatives of the space coordinate 
and Eq. (5.16) should include a space differential: 

£(0(x),0(x))— ►£(0(x),3, I 0(x)) 9„ = 0„, V) (5.17) 

where 3 t is replaced by 3o = (l/c)(3/9t) to make the dimension of the four-differ- 
ential equal. Taking an infinitesimal variation of the action 

and partial integration of the second term gives surface integrals , including 6<p(x). 
In a mechanical system with finite degrees of freedom, the boundary condition 
was dq r {h) = dq r (t 2 ) = 0. For the field we require that 6<p(x) — also at some 
boundary point, which is usually chosen at infinity. Then the surface integral van- 
ishes and 



This is the Euler-Lagrange equation for the field. An obvious advantage of the La- 
grangian formalism is that the Lagrangian density is a Lorentz scalar and easy to 
handle. The symmetry structure is also easy to see. The Lagrangian density is cho- 
sen to give known equations of the field or sometimes given a priori from some 
symmetry principle. Those to produce the real or complex Klein-Gordon equa- 
tions, the Maxwell equations of the electromagnetic field, and the Dirac field are 
given by 



■/" 
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= -(d^d^^-m 2 ^ 2 ) real field (5.20a) 

= d u <p J <d f, (p-m 2 y<<p complex field (5.20b) 

— F /IV F' UV = -(E 2 -B 2 ) electromagnetic field (5.20c) 

= ^(iy^d^ -m)ip Dirac field 1 ' (5.20d) 



Using the Euler-Lagrange equations, derive the equations of motion from the La- 
grangian Eq. (5.20). 



Massive Vector Field 

For comparison, we give here the Lagrangian of a massive vector field to realize 
some special features of the massless electromagnetic field. The Lagrangian of a 
massive vector field is given by analogy to the electromagnetic Lagrangian by 

£em = -\f,vf f ' v + \m 2 W fl W"-gW fl j" (5.22a) 

ff" = d /i W v -d v W fl (5.22b) 

The equation of motion is given by 

d fl f" v + m 2 W v =gj r (5.23) 

which is called the Proca equation. If j fl is a conserved current, then by taking the 
divergence of Eq. (5.23), we can derive an equation 

3,, W = (5.24) 

Then Eq. (5.23) is equivalent to the following two equations: 

(d^d" + m 2 )W r = gj v , d fl W fl =0 (5.25) 

Equation (5.25) shows that the number of independent degrees of freedom of the 
Proca field is three, and when quantized, it represents a particle with spin 1. In 
the case of the electromagnetic field (or m = vector field in general), unlike the 
massive field, the condition Eq. (5.24) does not come out automatically and has to 
be imposed as a gauge condition. 

1) Sometimes a symmetrized version is used: 

£ Dila c = \ [(-i3„?r"V - mlfitp) +^(y"i3„ - m) ?] (5.21) 

It differs from Eq. (5.20d) by a surface integral. 
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Hamiltonian Formalism for Fields 

The conjugate momentum field is defined just like particle momentum Eq. (5.6): 

n(x) = £§ (5.26) 

00 

Likewise, the Hamiltonian density for the field can be obtained using the same 
formula as Eq. (5.8) 

X(x) = 7t(x)4>(x)-Z (5.27) 

The Hamiltonian is a constant of motion as is the case for point particle mechanics, 



dt 



if 



d 3 xM = 



Proof: 



X = -(7I<P -X) = JC<t>+Jt<j>-[ — + — + -=-0 + — — -h<t> 
dt \ dt 00 00 o(3jt0) / 

(5.29a) 
d / <3<5£\ . [ n { dX \ 6X1 . 

where we have used the Euler-Lagrange equation to obtain the third equatity in 
Eq. (5.29b). Partially integrating the first term gives 



Because of this equality, the first term of Eq. (5.29a) is canceled by the second and 
fourth terms in large parentheses in Eq. (5.29a) and the second term by the third 
term in large parentheses. Then 

M = -^ (5.29d) 

Therefore unless the X has an explicit dependence on t, 2) the time variation of the 
Hamiltonian density vanishes. □ 

The Hamiltonian formalism is convenient for quantization. But as we shall see 
soon, it is the 00 component of a Lorentz tensor and hence more complicated from 
the Lorentz covariance point of view than a simple scalar Lagrangian. 

2) Usually this happens when a time-dependent external force acts on the system. Remember 
mechanical energy conservation under the effect of a potential. 
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5.1.4 

Noether's Theorem 

We have seen that the energy of a field is a conserved quantity. But a question aris- 
es immediately: are there any other conserved quantities? There is a convenient 
and systematic way of deriving conserved quantities known as Noether's theorem. 
It shows that there is a constant of motion for each continuous symmetry trans- 
formation that keeps the Lagrangian invariant. Then observed selection rules can 
be rephrased in terms of symmetry requirements of the Lagrangian and serve as 
useful guides for introducing new interaction terms when developing new theo- 
ries. Let us assume the Lagrangian is a function of the fields (p r and 3 /( r , where r 
represents some internal degree of freedom, and that it has no explicit dependence 
on the coordinates x M . This is the case when the fields do not couple with an exter- 
nal source, i.e. if the system is closed. Then for some infinitesimal variation of the 
field (p r , the variation of the Lagrangian is given by 

6X =Wr d * r+ ^) dlB "* T) (5 ' 30) 

As d (dp <p r ) = d M d <p r , the above equation can be rewritten as 

The first term vanishes because of the Euler-Lagrange equation. So if the La- 
grangian is invariant for some continuous transformation of (p r , the second term 
must vanish, which defines a conserved current J 1 *. This is called the Noether cur- 
rent and satisfies the continuity equation 

J " = d^) 6 ^ 9 ' J " = ° (5 - 32) 

The meaning of the conserved current is that the space integral of the Oth compo- 
nent does not change with time, i.e. 



./,,, 



Proof: 

dQ 



f = /^J° = -/^v.j = -/ sd a;„ = o 



where in going to the fourth equality we have used Gauss's theorem to change 
the volume integral to a surface integral, and the last equality follows from the 
boundary condition that the fields vanish at infinity. □ 
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Conserved Charge 

As an example, let us derive a charged current for a complex field. The Lagrangian 
given by Eq. (5.20b) 

£kg = d fl <p J 'd"<P ~ mW (5.35) 

is invariant under continuous phase transformation 

(p —*■ e~ lqa <p, <p ' ^>- (p 1 e' qa (5.36) 

where a is a continuous variable but constant in the sense it does not depend on x. 
The phase transformation has nothing to do with the space-time coordinates and 
is an example of internal symmetry, referred to as 1/(1) symmetry. Then taking 

d<pi = 6<p = —iqda<p, d(p 2 = d<p ( = iqda(p ( (5.37) 

the Noether current is expressed as 

J" = iq[ V <d><<p-(d"<p<)<p]da (5.38) 

Removing da we have reproduced the conserved current: [Eq. (2.41)]. The con- 
served charge is expressed as 

Q = -iqj d'x J2 [n r (x)q>r(x) - »J(*)tf (*)] (5.39a) 

= iqjd 3 x[pfg>-^g>] (5.39b) 

The first expression is a more general form, while the second one is specific to the 
complex Klein-Gordon field. 



The Lagrangian density of the Dirac field is made up of combinations of ip(x), xp (x) 
and is invariant under the gauge transformation 



Prove that the Noether current corresponding to infinitesimal transformations 
dip = —idaip, d~xp — idaxp is given by 



Energy-Momentum Tensor 

We consider here translational invariance under an infinitesimal displacement of 
the coordinate 
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Then 

6<p r = </> r (jc'' + dx 1 ') - <p r (x) = d fl <p r dx>' (5.43a) 

(5(9^0,) = d pl (p r {x v + dx") - dptcprix) = d fl d v <p r dx" (5.43b) 

If the Lagrangian does not depend explicitly on x 1 * , the variation can be expressed 
in two ways: 

dXy = d^Zdxf (5.44a) 

d£ 2 = ^-64>r + -Y^rMh4>r) (5.44b) 

In calculating the action variation, unlike the internal transformation where the 
variation of the field is evaluated at the same space-time, we have to take into ac- 
count the volume change in the integral, too, i.e. 

6S = f d 4 x'Z{x) - f d 4 x£{x) = f d 4 xd£ + f(d 4 x' - d 4 x)£ (5.45) 

The second term can be calculated using the Jacobian of the transformation, but 
here we rely on the fact that d S = 0, which means it is equal to minus the first 
term. Therefore, 



= /**(«,. 



A, n (5-46) 

■)-d /t £dx'' 



J ldcp r Yr (5(9^,.) 

Inserting Eq. (5.43) into the first term of Eq. (5.46), the integrand becomes 

Yd£_ _ d£ 1 j dX d v( p r - df'xl 6x v (5.47a) 

The first term vanishes because of the Euler-Lagrange equation. Thus we have 
obtained a conserving tensor called the energy-momentum tensor: 

d fl T f ' v = (5.47b) 

(5.47c) 

uVr) 

The name is given because 

T ° = ^p7) d °^- £ (5 ' 48) 

agrees with the Hamiltonian density given in Eq. (5.27). Its space integral is a 
constant of motion and reconfirms Eq. (5.28). Then, by Lorentz covariance, P' = 
f d i xT° l = f d^xSP 1 are momenta of the field. They are also constants of motion. 
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Problem 5.3 

Using the Jacobian prove that 



Calculate the energy momentum tensor of the scalar field whose Lagrangian is 
given by Eq. (5.20a) and show that it is symmetric with respect to suffixes fi and v. 

We can derive the angular momentum tensor by requiring the invariance under 
an infinitesimal Lorentz transformation given in Eq. (3.79) 

dx f '=e^x v (5.50a) 

d<p(x) = ^x»d u cp(x) (5.50b) 

Problem 5.5 

(a) Show the corresponding angular momentum tensor is given by 

M P,,<v = T PV X I> _ T P,< x v (5 51 ) 

(b) Show that 

d p M pf ' v = Tf" - T vfl (5.52) 

Hint: Replace the second term in Eq. (5.47a) by 6x v -> €yX v and use e F „ = — e MV . 

Problem 5.5 shows that T 7 "' = T F/< is a necessary condition for the angular 
momentum tensor to be conserved. However, the expression for T /lv in Eq. (5.47c) 
is not manifestly symmetric and indeed it is not always symmetric. We note that the 
energy momentum tensor itself is not a measurable quantity but its space integral 
is. So there is the freedom to add a total derivative. We may add a term of the form 

dlf f ' v , /*"' = -/**» (5.53) 

so that 

9a9«/ A '"' = (5.54) 

We consider a new energy momentum tensor 

T" v ->■ T"" + dxf X " v (5.55) 
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This new energy momentum tensor is conserved and we can always choose this 
extra term to make it symmetric in fi and v. 

There is another reason to have a symmetric energy momentum tensor. In gen- 
eral relativity, the symmetric gravitational field or curvature tensor R ftv couples to 
the energy momentum tensor through Einstein's equations: 

1 SjzG 
R /lv -^g /lv R = -^-T flv (5.56) 

The Ricci tensor R /lv and the metric tensor g flv are both symmetric and T/, v should 
be symmetric, too. 

The Complex Field 

We now apply the energy momentum tensor formula to Eq. (5.20b) and derive the 
Hamiltonian and the momentum operators expressed in terms of the fields: 



d(d,,<p) V <5(9„<p') V 




^ <5(3 O r ) 


(5.57a) 


= d°qftq> + <p'' d°cp — (d u (p l d"(p — m 2 (p { cp) 




= </9 ' <p + V<p ' • V(p + m <p ' ip 




^ = E r0l = ^ 9 ^ = -(^v, + W) 


(5.57b) 


H = / d?x3t = / d 3 x [<pt<p + Vy 1 • V(p + mV>] 


(5.58a) 


P = / cPxP = - d 3 x [cp^Vcp + V^<p\ 


(5.58b) 



(1) Prove that H and P are conserved quantities by explicitly differentiating them. 

(2) Prove that the Hamiltonian density X and the momentum density P satisfy 
the continuity equation, namely 

— + V • 3> = (5.59) 



n of the electromagnetic field, Eq. (5.20c), show 

- (E 2 + B 2 ) (5.60a) 

• x B (5.60b) 



Proble 


m 5.7 


Using 


the Lagrangi 




" = /"'* 




*-!** 
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Problem 5.8 

Using the Lagrangian of the Dirac field, Eq. (5.20d), show 

H = f d^xlp(-iy ■ V + m)ip = f cfxipUip (5.61a) 

P = j d i xip' f (-iV)rp (5.61b) 

Q= q tfxxp^xp (5.61c) 

Quantization Scheme 

There are several ways to make the transition from classical mechanics to quantum 
mechanics. Probably the most powerful method is that of the path integral, which 
deals with the action, taking all possible paths into account. Historically, howev- 
er, the Hamiltonian formalism was used first and has been extensively used since 
then. The path integral method is a powerful tool for treating the formal aspects 
of the theory, and indispensable in discussing non-Abelian gauge theories, but if 
one is interested in quick practice at calculating transition rates, the conventional 
Hamiltonian formalism is more easily accessible using the accumulated knowl- 
edge of quantum mechanics. Since the path integral uses mathematical approach- 
es that for laypeople need some acquaintance before solving realistic problems, we 
shall defer the study of the path integral method until Chap. 10. 

5.2.1 

Heisenberg Equation of Motion 

To derive the quantum mechanical representation of the equation of motion that is 
suitable for the treatment of the harmonic oscillator, we start from the Schrodinger 
equation 

i^|V(*)) = H|V(*)) (5.62) 

Here, we have used Dirac's notation to describe state vectors in Hilbert space on 
which operators in quantum mechanics operate. The formal solution to the equa- 
tion is expressed as 

\xp(t)) = e -' Ht |V(0)> (5.63) 

where we have suppressed the space coordinate dependence. The Hamiltonian in 
the Schrodinger equation contains operators as functions of canonical variables 
q, p in the form q —*■ x, p — > — iV and which are time independent. The time 
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dependence is in the state vector. This is called the Schrodinger picture. In the 
following, we attach a subscript S to quantities in the Schrodinger picture and a 
subscript H to the Heisenberg picture that we describe now. We define state vectors 
and operators in the Heisenberg picture by 

IVh) = e ,Hst |^ s > = IVs(0)> (5.64a) 

H = e ,Hst O s e- ,Hst (5.64b) 

In the Heisenberg picture, all the time dependence is in the operators and the 
state vectors do not change with time. From Eq. (5.64b), we can derive the time 
dependence of the Heisenberg operators: 

-^ = i[H s , O h ] (5.65) 

Putting O s = H s , Eq. (5.64b) shows H H = H s and from Eq. (5.65) we see that the 
Hamiltonian does not change with time, i.e. it is a conserved quantity. 3 ' From now 
on, we omit the subscript of the Hamiltonian and simply write it as H. Then the 
equation that the Heisenberg operators satisfy is written as 

dO H 

-jf- = i[H, O h ] (5.66) 

which is called the Heisenberg equation of motion. Its relativistic version can be 
expressed as 



This is still a quantum mechanical equation. We can prove that Eq. (5.67) is also 
valid in quantum field theory, namely if the operators P /( and Oh are functions of 
quantized fields. We shall prove this later in Sect. 9.1.1. 

Problem 5.9 

Derive the space part of Eq. (5.67). 

We can make the transition from classical mechanics to quantum mechanics by 
the following steps: 

(1) Regard the classical variables 
as quantum operators 

(2) Impose the commutation relation [<}i,pj] = i<5jj 

(3) Then calculate d M H = i[Pp„ O h ] 

The advantage of the Heisenberg picture is that it makes the correspondence 
with classical mechanics apparent. 

3) See Eq. (5.29(1) for a caveat. 
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5.2.2 

Quantization of the Harmonic Oscillator 



We start with the quantization of the harmonic oscillator, a prescription described 
in any standard textbook of quantum mechanics. The Hamiltonian is given by 
Eq. (5.11). Using the Heisenberg equation of motion 



which is the same equation as Eq. (5.12), but q and p are now operators. Now we 
introduce operators defined by 

?=y^(« + «'). P = -i^f(a-a<) (5.70) 

The canonical commutation relation (5.68) becomes 

[«,flt] = l (5.71) 

The Hamiltonian in terms of a, a~' ( is given as 

H = i«(»t tf+Btt t) = 0,^+0 (5.72) 

Inserting the new variables into the Heisenberg equation gives 

a = -iaa, &t = i(0a 1 (5 .73a) 

.'. a=a e- ia ' t , a< = aje i<at (5.73b) 

We rename o and a\ as a and o'i" and consider their properties. As will be shown 
in the following, eigenvalues of the operator N = a' a are either zero or positive 
integers. Hence the energy intervals of the harmonic oscillator are all the same 
(AE = fia)). 

Problem 5.10 

Using the Hamiltonian (5.72) and the commutator (5.71), derive Eq. (5.73a). 

The Number Operator 

We prove that the operator N = a ( a has positive integer eigenvalues and can be 
considered as the number operator. It satisfies the commutation relations 

[N, a] = -a, [N, a>] = a' (5.74) 

Let us consider an eigenstate of the operator N with eigenvalue n: 

N\n) = n\n) (5.75) 



Using the commutator (5.74), we see 

Na<'\n) = »t(i + N) |„> = (n + i )fl i| n ) (5 .76a) 

Na\n) = (N - l)a\n) = (n - l)a'|n) (5.76b) 

which means that a"i"|n), a|n) are also eigenstates of N with eigenvalues (n ± 1). 
Therefore, a, a f are operators that change the eigenvalue by +1 and —1, respective- 
ly. Set the normalization condition as 

(n|n) = l, a\n) = c n \n-l) (5.77a) 

n = (n\N\n) = {n|«t„| n) = | c „f („ _ i| n - 1) = |c„| 2 -> c „ = ^ 

(5.77b) 

We have chosen the phase of c n to make it real. Equation (5.76) means that the 
expectation values of a ( , a are nonzero only between | n) and | n ± 1) : 

|w) = o^-i) = (^v-^ — .J'T.'.";,"^ < 5 - 78 ) 

Repeated operation of a on | n) eventually makes the eigenvalue negative. Because 
of Eq. (5.77b), the state cannot have negative eigenvalues. To avoid contradiction, 
there must exist a state |0) that satisfies 



We call it the ground state or the vacuum state. Equation (5.78) tells us that any 
state can be constructed from |0) by multiplying by a': 

|n) = ^|0), N\n) = n\n) (5.80a) 

H\n) = a> (at a + M \n) =co(n+^J \n) (5.80b) 

The value of the energy is given by a multiple of a positive integer times co plus 
some fixed constant. 

The fact that all the energy intervals of a harmonic oscillator are equal becomes 
important when we quantize the field. At this point we make an important re- 
interpretation. We do not consider that the energy is n multiples of (u but that n 
particles with energy w are created. Then N = a ( a represents a number operator 
of the particle and the Hamiltonian H = cua^'a represents the energy sum of n 
particles, a and a 1 ' can be interpreted as the annihilation and creation operators 
of the particle. The constant term &>/2 in the Hamiltonian is called the zero-point 
energy. For the moment we ignore it because the reference energy can always be 
shifted and hence unobservable. We shall come back to it in the last section of this 
chapter. 
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Quantization of the Fermion 

We have derived the equation for a harmonic oscillator in two ways: A using q, p, 
and B using a, a< . Method A uses Eq. (5.11) as the Hamiltonian and the commu- 
tator Eq. (5.68) while method B uses Eq. (5.72) and Eq. (5.71). They are connected 
by the relation Eq. (5.70). Whichever we choose as the starting point, we can derive 
the other. Therefore, both methods are equivalent. However, we notice that if we 
choose B as the starting point, the commutator Eq. (5.71) is not a unique choice. 
If, instead of the commutator, we require anticommutator relations, we have 

[a, a'] + B {a, fl t} B o«t + a t a = 1 (5.81a) 

{«,«} = {at, fl t} = (5.81b) 

We can equally obtain the harmonic oscillator using the Heisenberg equation. 



Using the Hamiltonian Eq. (5.72) and the anticommutator Eq. (5.81), prove that 
the Heisenberg equation of motion gives the same harmonic oscillator equation 
Eq. (5.73a) for a, a<. 

In this case, however, the eigenvalues of the number operator are limited only to 



N 2 = a<aa f a = a' (1 - a 1 ' a)a = a< a = N (5.82a) 

.-. N=1,0 (5.82b) 

The eigenvectors are |1) and |0). Besides, 

W|0) = aW|0) = a'(l - flt fl) |0) = fl t|0) (5.83a) 

.-. »T| ) = |1> (5.83b) 

Similarly, 

a|l) = |0), a|0)=0, at|l)=0 (5.83c) 

The last equation shows that »t behaves like an annihilation operator acting on 
|1), because there is no n — 2 state. In summary, a, a< can be interpreted as the 
annihilation or creation operator of a particle; only one particle can occupy a state. 
Such a particle is called a fermion. Expression of the Hamiltonian in terms of q, p 
using the relation Eq. (5.70) gives 

H = iojqp (5.84) 
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which has no simple interpretation in classical mechanics and is, therefore, a pure- 
ly quantum mechanical object. Expressed in terms of a, o 1 ' 

H = a>(aU-^\ (5.85) 

which has a logical physical interpretation. 

Particles that satisfy the commutation relation are said to obey Bose-Einstein 
statistics and are called bosons, while those that satisfy the anticommutation re- 
lation obey Fermi-Dirac statistics. Any number of bosons can occupy a state, but 
only one fermion can enter a state. 



Quantization of Fields 

How can we quantize fields? In Eq. (5.15), we have seen that the field is a collection 
of oscillators q r distributed over all space x, and that x has been chosen for num- 
bering instead of r. To make the transition from discrete r to continuous x, let us 
divide the space into rectangular boxes of volume AV = A%AyAz and number 
them by r. Define q r as the field averaged over A V at point x = x r multiplied by 
A V, i.e. q r (t) = (p(t, x) AV. Then the field Lagrangian can be rewritten in terms of 



I = \ J d 3 x[0 2 - (V</>) 2 + mV] = J d'x£ = J2 AV£ r 
= ^2 Qr ~ (terms independent of q r ) (5.86) 

We define the conjugate momentum by 

*"-£-£-S£!£r-HrT-*«"'> t5 - 87 ' 

dq r AV d(cp(x r )AV) d<p(x r ) 
Then the Hamiltonian is given by 

H = J2 VAr ~ L = J^ Jt(x r , t)0(x r )AV - I AV ^°> J d l xM (5.88a) 

M = a{x, t)4>(t, x) - i[0 2 - (V0) 2 + m 2 ] (5.88b) 

which reproduces the field Hamiltonian in the continuous limit. Since we have 
replaced the field by a collection of discrete oscillators, the quantization can be 
performed in a quantum mechanical way: 
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The equal time commutator of the field can be obtained by replacing q r , p r by 
<p(t,x r )AV,jt(t,x r ) 

[0(t, x r ), «(t, *,)] = i-£ ^X [0(t, x), »(t, y)] = id 3 (x - y) (5.90) 

In summary, it is reasonable to assume that the quantization of the fields can be 
achieved by 

1. interpreting the classical fields as quantum field operators in the Heisenberg 
picture, and 

2. imposing equal time commutation relations of the fields and their conjugate 
momentum fields 

[<p s (x),Jt t ( Y )] tx=ty = id st d i (x-y) (5.91a) 

[*.{*). 0.{y)]*.-*, = [».(*). *.(y)k=i, = o (5.9ib) 

where the indices s, t specify additional internal degrees of freedom. We have 
used x to denote the four-dimensional space-time coordinate, i.e. x = (t, x). 
The conjugate momentum field is defined by 

dX 

*,(*) = £T~ ( 5 - 91c ) 

In this book, however, we shall adopt an alternative method to quantize plane waves 
with specific wave number fe and frequency co. Because any wave field can be 
(Fourier) decomposed in terms of waves that are harmonic oscillators, the method 
we learned in quantum mechanics is directly applicable. Both methods are equiva- 
lent, but the latter probably makes it easier to grasp the physical picture behind the 
mathematical formula, although once one is used to the method, the former may 
be easier to handle. 

5.3.1 
Complex Fields 

We start from a spin complex scalar field that obeys the Klein-Gordon equation. 
It has two internal degrees of freedom and can be expressed in an alternative form: 



Here, we treat <p and <pl as two independent fields. The Lagrangian is already given 
by Eq. (5.20b) and the equation of motion can be derived from the Euler-Lagrange 
equation. The energy, momentum and other conserved physical quantities were 
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given in [Eq. (5.58)]. 4 > 



H = / A ' x [(^r) (S) + (V ' v ^ + mV<0 ] (5 - 93a) 

P = -f d 3 x[^V^ + V^J ( 5.93b) 

Q=^/^[^^-^] (5.93c) 

! Klein-Gordon equation is a linear wa 
waves having wave number k 



Since the Klein-Gordon equation is a linear wave equation, it can be expanded in 
terms of waves having wave number k 



(5.94) 



The continuous limit is written for later reference, but in this chapter we stick to 
discrete sum to aid intuitive understanding of the quantization procedure. Apply- 
ing the Klein-Gordon equation 

[d ll df + m 2 ] V = J2('^ + w2l ik)e ihx =0, w = y/k 2 + m 2 (5.95) 

k 

which shows that the Klein-Gordon field is a collection of an infinite number of 
harmonic oscillators indexed by the wave number fe. Those with different fe are in 
different oscillation modes, and hence independent of each other. We separate q\ 
into positive and negative frequency parts 6 ' and write 



1 



(5.96) 



where Vis the normalization volume for the plane wave. Inserting Eq. (5.96) into 
Eq. (5.93), and using the orthogonal relation (refer to Eq. 6.73) 



v/"'""-"" 



(5.97) 



4) The wording "electric charge" is used here for sum, but even in the continuous limit (i.e. 

conserved charge4ike quantum numbers, for think V is large but finite. See Sect. 6.6.2 for 

example, strangeness. detailed discussion. 

5) Later we will set V = 1, but we retain the 6) By convention we call e~ iwt positive 



is finite, we should use a discrete 



. . 
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we obtain 

H = £©[a|a t + cjc»] (5.98a) 

k 

p = j2 k i a i a ^- c l c t] ( 5 - 98b ) 

G=X>[4«*- c i c *] ( 5 - 98c ) 

Then we impose 

[a k , a-l] = d w , [a k , a v ] = [a\, a[,} = (5.99a) 

[c lc ,cl,] = d w , [<*,»*] = [cj,cj,] = (5.99b) 

[a k , c'l] = [aj, c k ,] = [a,, Cfc ,] = [a[, c[,] = (5.99c) 

Here, (a\, a k ) can be interpreted as creation and annihilation operators of a particle 
with energy-momentum (E = ha>, p = hk). However, if we apply the same rule to 
the negative frequency part, (c^, c k ) creates or annihilates a particle with negative 
energy. This is not allowed since there should be no particles with negative energy. 
Let us examine more carefully what effects c\ and c k produce when operating on 
the state vector. As before, we define the number operator N by 

N = J2( a l a ^ + c l c t) (5-100) 

Using the commutation relation Eq. (5.99), we obtain the relations 

[N, 4] = a[ , [N, a k \ = -a k (5.101a) 

[N, c\] = c\ , [N, c k ] = -c k (5.101b) 

which shows that a[, c[ can be interpreted as creation and a k , c k as annihilation 
operators of particles with momentum fe. Now we apply the Heisenberg equation 
to the field <p and have it operate on a energy eigenstate | E) 

<p\E) = i[H,<p]\E), H\E) = E\E) (5.102) 

Compare the coefficients of e~ llot and e +lmt on the two sides of the first equation. 
Using the relation (5.101), we obtain 

-icoa k \E) = i[H,a k ]\E) = i(H - E)a k \E) 
imc k \E) = i[H,c k ]\E) = i(H - E)c k \E) 
:. Ha k \E) = (E-w)a k \E), (5.103a) 

Hc k \E) = (E + CD)c k \E) (5.103b) 
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at, the coefficient of the positive-frequency waves, reduces the number of particles 
by 1 and also reduces the energy of the state | E) by m . So it is a legitimate annihila- 
tion operator. But c k , the coefficient of the negative-frequency wave term, actually 
increases the energy of the state by w. A logical interpretation is then that c k cre- 
ates a particle and should be considered as a creation operator. We then redefine 
c-i -*■ b\, c_ h -► b k and the commutation relations as 

b k = clj , b\ = c- k (5.104a) 

[b k , b\ r ] = 6 W , [b k , b k >] = [hi, b\,} = (5.104b) 

[a k , b\,] = [a k , b k >] = [a[, b\,} = (5.104c) 

The reason for adopting — fc is to make the second term in Eq. (5.98b) positive so 
that the particle has positive momentum as well as positive energy. We obtain the 
newly defined field and Hamiltonian, etc. as 

In terms of the new creation/annihilation operators, the energy, momentum and 
charge operators are rewritten as 

H = ^ «[a,[a fe + b[b k + l] (5.105a) 

p = J2 \c[a[a k + b[b k + l] (5.105b) 

Q=j2<i[ a i a k- b i b i'- 1 ] < 5 - io5c ) 

The constant term appears because c ] _ k c- k = b k b[ = 1+ b[b k . The above ex- 
pression tells us that b k is an annihilation operator of a particle having the energy- 
momentum (ft), k), but opposite charge, which we call an antiparticle. Note, we of- 
ten encounter a statement that the antiparticle has negative energy. From the above 
discussion, the reader should be convinced that there are no particles with negative 
energy. The antiparticle has positive energy, too. 



Summary of Field Quantization 
Field Expansion 



* + by 
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Commutation Relations 






[ai,al] = d w , [h,bl,] = d w 




(5.107a) 


[•».^] = [»I.4]=0. [h,h>] = [bl 


b !'l = ° 


(5.107b) 


[a h ,bl] = [a[,hv] = K^] = [4 


b l>] = ° 


(5.107c) 


Energy, Momentum and Charge Operators 






H = J^o>[ala i +blb t + l] 




(5.108a) 


* = X>[4«* + &1&* + *] 




(5.108b) 


G=l] «[«!«» -*I«»*-i] 




(5.108c) 



Using relations in the box, it is a straightforward calculation to prove 

l<p(x),jT(y)] tx=ty = id\x-y) (5.109a) 

M*).p(y)k-t, = [»(*). »(y)k-», = o (5-10%) 

n(x) = T7T-T7 = 4>Hx) (5.109c) 



Decomposing the complex field and its conjugate momentum in terms of annihi- 
lation and creation operators, prove Eq. (5.109). 



Note: As the conjugate momenta ofcp and <p J( . 



the commutators for [<p, n v \ and [cp' ( , n^\\ are equivalent in the sense that one is the 
hermitian conjugate of the other. Therefore there are no additional commutators 
to Eq. (5.109). 
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Equation (5.106) may be inverted as 

a k = ifd i x e * k (x)X<p(x), al = -ifd i x ek (x)XvHx) (5.111a) 

b k = ij d i xe* k (x)X<pHx) , b[ = -ij d'xe k (x)X$(x) (5.111b) 






(*) ( 5 - 112 ) 



Quantization of the fields can also be achieved by imposing the commutation 
relations Eq. (5.109) on the fields and their conjugate momentum fields. Then the 
commutation relations for the creation and annihilation operators Eq. (5.107) can 
be proved by using Eq. (5.111) and the commutation relations Eq. (5.109). The two 
methods of quantization are mathematically equivalent and one can be derived 
from the other. 



I Starting from the field commutators Eq. (5.109) and using Eq. (5.111), prove the 
commutation relations for the creation and annihilation operators Eq. (5.107). 



5.3.2 
Real Field 

The expression for the real field is obtained by setting <p = (p~' ( = / \fl and insert- 
ing it into Eq. (5.93), giving 

H= 1 -ld l x [0 2 + (V0 • V0) + mV] (5.113a) 

P = -/^(^V ) (5.113b) 

The complex scalar field is actually two real scalar fields of the same mass added 
together. Inserting ip = (cpi + i(t> 2 )/V2, ^' = ((pi — icp2)/\/2, the energy-momen- 
tum of <p and cpi, (p2 can be converted into each other. The difference is that no 
charge operator can be defined for the real field, which is readily seen by looking at 
Eq. (5.93c). Therefore the real field describes electrically neutral particles, but the 
converse is not true. For instance, K° and K are electrically neutral, but have the 
"strangeness" quantum number to distinguish them and hence are described by 
the complex field. 
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Fock Space 

As can be seen from Eq. (5.106), cp connects states differing in particle number by 1. 
Especially, a matrix element between the vacuum and one-particle system appears 
frequently. Defining |fc) = V2wa[|0) 7) 

a k \0) = (Q\a[ =0, (0|a k |fc) = <fc|ajj0) = ~Jl^ (5.114a) 

(G\ v (x)\k) = ^=e~ ik -\ (k|p(*)|0> = A=^' X (5.H4b) 

k-x = mt-k-x (5.114c) 



Problem 5.14 
Show that 

\E) = \k 1 ,k 2 ,---,k m ,q 1 ,q 2 ,---,q n ) 

■■■y/2w qn b\ n \0) (5.115) 

is a state with m particles and n antiparticles whose total energy and momentum 
are given by 

IB = w kl + ■ ■ ■ w im + co qi +---w q „ (5.116a) 

P = &! + •••+ fcm + q, + ••• + ?„ (5.116b) 

A state with any number of particles represents a Fock vector, and all Fock vectors 
constitute a Fock space. 

Summary The relativistic wave function suffered from negative-energy particles. 
But if it is reinterpreted as the classical field and quantized again, the difficulty dis- 
appears. Because of this, the quantization of the field is often called second quanti- 
zation. The quantized field (p is no longer a wave function but an operator that acts 
on the Fock vectors (states in the Heisenberg picture). It has wave characteristics in 
its expansion in terms of e~ lk ' x and also a particle nature, in that the coefficients 
a\, a\ can be interpreted as the square root of the particle number. 

5.3.3 
Dirac Field 

From the Lagrangian of the Dirac field Eq. (5.20d), using Noether's theorem we can 
derive expressions for the Hamiltonian etc.: 

H = I d l xip J < ( a ■ — + Pm\ ip = \ d l xipU^- (5.117a) 

7) This is a relativistic normalization that we will adopt. The details will be described in Sect. 6.5.2. 
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P = f d 3 x^(-iV)ip (5.117b) 

qj" = qf d l xyy>'ip = q J d l x^(l, a)xp -> Q = gj° 

= J d 3 xiptip (5.117c) 

We expand the field in terms of plane- wave solutions of the Dirac equation (see 
Sect. 4.2). Writing down the Dirac fields as 

V = E j^ry [ a P rUrWe-**-* + b\ MPy rx ] (5.118a) 

W=Y1 y=f=^ [ a l MpV*-* + b p r v r (p)e->P-»] (5.118b) 

H,P,Q can be rewritten as 

H = J2 E P (4,r«?,r " b p ,rbl,r) (5.119a) 

p,r 

P = Y,P( a l,r»P,r-br,,rbLr) (5.119b) 

J>,r 

Q=^^(»J.r»*.r + i,.riJ.r) (5.119c) 

p,r 

Notice the difference from and similarity to Eq. (5.98). Here we note that if we apply 
the same logic as we followed after Eq. (5.98), we cannot use the commutation 
relations like those of Eq. (5.104), because they produce negative energies, as is 
shown by the second term of the Hamiltonian. But if we impose the following 
anticommutators: 

{a P , r ,a\, r ) = {b v , r ,h^ r ,} = 6 v , p ,d r , r , 

{a pj , a p ,y} = [air, »;,,„} = {hj, r . b\, y ) = {b<, r , b^,} = (5.120) 

j« Pir ,i; v } = { V ,V/} = (4^; v ! = o 

the energy becomes positive definite and we obtain physically consistent interpre- 
tations: 

H = E E P {»P,T*p.r + K,rb p ,r - l) (5.121a) 

P = E P (4,r*P,r + bl, r b,, T - l) (5.121b) 

p,r 

Q=J2 E ? (4,r«?,r " b\, r b pj + l) (5.121c) 

Therefore, the Dirac field must obey Fermi-Dirac statistics. 



114 I 5 Field Quantization 

Problem 5.15 

Show that Eq. (5.118) can be inverted to give 



a pj = f d'xu pj (x)e* v {x)y a xp(x) , a\ r = f d 3 > 



ip(x)y»u p Ax)e p (x) 
(5.122a) 

h pj = J d 3 xlp(x)y v p Ax)e p (x) , h\ r = J d 3 xv p Ax)e* p (x)y°rp(x) 

(5.122b) 



The equal time commutators of the Dirac field are given as follows. The conju- 
gate field is defined by 

7i r =j— = iWy\ = iil>} (5.123) 

Since the Lagrangian contains no derivatives of ijJ, there is no conjugate momen- 
tum to t/>1". s) The above equation shows that i^ 1 ' itself is the conjugate momentum 
to ip . Therefore the commutators are 

{VAX), VUYfit.-t, = d rsd 3 (x - y) (5.124a) 

bPr{x), xps(y)} tx=h = M(*), Vj(y)} ti=(y = (5.124b) 

5.3.4 
Electromagnetic Field 

The electromagnetic field can be expressed by the potential A" = (0, A), where 
is the scalar or Coulomb potential and A the vector potential. It is a real vector po- 
tential and, when quantized, represents a spin 1 neutral particle, called the photon. 
It obeys the Maxwell equation in vacuum: 

d fl Ff<" = dftd" A" - 9" (9^) = (5.125a) 

Ff" = d /i A v -d v A" (5.125b) 

As it has gauge symmetry, which means the electromagnetic field F f,v does not 
change by a gauge transformation, 

A"^A'"(x) = A"(x) + 9";t(x) (5.126) 

8) If one uses the symmetric Lagrangian of Eq. (5.21), one sees that the conjugate momentum to ip J ' 
is — 1 1/ . which yields the same commutation relation. 
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The gauge has to be fixed for calculating observable quantities. The usual conven- 
tion is to adopt the Lorentz gauge 

3,,A'' = 0: Lorentz gauge (5.127) 

Then the Maxwell equation becomes 

9 / ,3''A r = (5.128) 

which is the Klein-Gordon equation with m = 0. Even after adopting the Lorentz 
gauge, we still have an extra degree of freedom to change the potential using a 
function that satisfies 

A f ' -+ A"" = A" + 3''§ , 9^3''! = (5.129) 

We shall show that although the potential has four components, it has only two de- 
grees of freedom because of the gauge symmetry. Writing the polarization vector of 
the photon having energy-momentum fe as e f '(k, X), the potential can be expanded 



(5.130) 



For the free electromagnetic potential, the following Einstein relation holds: 

fe 2 = fe / ,fe-"=0 or oj = k°=\k\ (5.131) 

The Lorentz condition Eq. (5.127) means 

k fl e f '(k,X) = (5.132) 

Equation (5.129) means e has an extra degree of freedom to change 

e* -W<" = 6^ + ik"i(k), fc 2 = (5.133) 

We can choose §(fc) to make e° = 0. Then the Lorentz condition becomes 

fc •<? = () (5.134) 

Therefore the free electromagnetic field (i.e. the electromagnetic wave) has only 
transverse polarization, with two degrees of freedom. 

Coulomb Gauge and the Covariant Gauge 

Because of the gauge freedom, the quantization of the electromagnetic field is non- 
trivial. The easiest way to understand it intuitively is to adopt the Coulomb gauge, 
which minimizes the degrees of freedom. In this gauge, the potential satisfies the 
condition 

V • A = : Coulomb gauge (5.135) 
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Then the field can be expressed in terms of transverse potential A± and instanta- 
neous Coulomb potential (see Appendix B). In this gauge, only the transversely po- 
larized photons are quantized; they are treated as two independent massless Klein- 
Gordon fields. The instantaneous Coulomb field has no time dependence, hence is 
not a dynamical object. It is left as a classical field. In this gauge, manifest Lorentz 
covariance is lost, although a detailed examination shows it is not. Due to absence 
of manifest covariance, calculations in general become complicated, and the usu- 
al convention is to adopt a method we refer to as the covariant gauge. It treats all 
four components of the field potential A" on an equal footing and independently. 
Then we expect to have commutators like Eq. (5.141a), which is in conflict with the 
Lorentz condition Eq. (5.127), at least as an operator condition. We also encounter 
another problem. The Lagrangian density does not contain A , which means that 
the conjugate momentum jt° does not exist. Therefore A cannot be a dynamical 
variable. A remedy to the problem is to add an additional term, called the gauge 
fixing term, to the Lagrangian 

£ = -V"F„„ - ^(3,-A") 2 (5.136) 

Then the Euler-Lagrange equation becomes 

d li d> i A v - ( 1 - - J (dp A*) = (5.137) 

which reproduces the desired Maxwell equation without imposing the Lorentz 
gauge condition Eq. (5.127), if we set 1 = 1. The price to pay is that the added 
term is not gauge invariant and we no longer have the freedom to make another 
gauge transformation. Namely, the gauge has been fixed and the choice of I = 1 is 
called the Feynman gauge. 

In the covariant gauge, the four potential fields are independent and we can as- 
sign four independent polarization vectors to each component. In the coordinate 
system where the z axis is taken along the photon momentum fe, they can be ex- 
pressed as 

e."(0) = (1,0,0,0) (5.138a) 

£■"(1) = (0,1,0,0) (5.138b) 

£■"(2) = (0,0,1,0) (5.138c) 

£■"(3) = (0,0,0,1) (5.138d) 

Circular polarization e(±) is also used instead of linear polarizations e(l), £(2) 

£■"(+) = (0,-1/72, -i/V2,0) (5.138e) 

e* (-) = (0, 1/V2, -i/V2, 0) (5.138f) 
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A quantum that has polarization e(3) is called a longitudinal photon, and that with 
e (0) , a scalar photon. The four polarization vectors satisfy the orthogonal condition 

M*W)*=*U' (5-139) 

Notice the polarization of the scalar photon, unlike others, is time-like, i.e. (e„(0) 
£/'(0) = 1 > 0). Then the completeness condition or sum of the polarization states 
is written as 

Y,SU'S /i (X)e v (X , )* = ^ v (5.140) 



Unphysical Photons 

The usual quantization method we have developed so far leads to a commutation 
relation for the electromagnetic fields 

[A», JZ v (y)] tx=ty = -ig^d\x - y) (5.141a) 

** = ^ = H„A", £) (5.141b) 

As we can see, the Lorentz gauge condition (d fl A>" = 0) contradicts the above 
commutation relation and forces jt° to vanish, thereby invalidating construction 
of canonical commutation relations for the fields. So we adopt a weaker Lorentz 
condition valid only for the observable states | <t> ) , \(p) 

(<p\d fl A f '\<P) = (5.142) 

and eliminate unwanted components from the final expression for the observ- 
ables. As a rigorous mathematical treatment of the method is beyond the scope 
of this textbook, we only describe the essential ingredients that are necessary for 
later discussions. The commutation relation consistent with the field commutator 
Eq. (5.141a) is 

[a M , 4j = -gxx'dtv (5.143) 

which gives a negative norm ((A = 0|A = 0) = —1) for the scalar photon. Obvi- 
ously a photon having negative probability is unphysical. The fact is that the longi- 
tudinal and the scalar photon appear only in intermediate states and never appear 
as physical photons, which satisfy the Einstein relation fe„fe'' = (o 2 — \k\ 2 = 0. 
The unusual property of the scalar photon works in such a way as to compensate 
unphysical contributions of the longitudinal polarization states. 

Intermediate photons created by electric current j 1 ' satisfy the Maxwell equation 

d fl d"A v = qj v (5.144) 

Comparing both sides in momentum space, it is clear that the energy • momentum 
of the intermediate photon does not satisfy the Einstein relation for the particle 

fc„F/0 (5.145) 
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The photon is said to be off-shell, or in a virtual state. Equation (5.144) is a result 
of interaction terms in the Lagrangian 

X -+ £ = Z - qj fl A" (5.146) 

Later we shall learn that the interaction term in the above equation appears natu- 
rally as a consequence of the "local" gauge invariance for a system including both 
the matter (Dirac) field and the electromagnetic field, but here we assume it as giv- 
en. Virtual photons are produced and absorbed by the currents (i.e. by interacting 
charged particles) and appear as a propagator in the form ~ 1/ k 2 (to be described 
soon) in the equation. While in the Coulomb gauge only the transverse polariza- 
tions need to be considered, in the covariant gauge all four polarization states have 
to be included: 

J2 s l '(X)e v (X)* = dtj - —^ Coulomb gauge (5.147a) 

X = ± |k| 

-► J2 £" We" W* -> -g f '" covariant gauge (5.147b) 

The arrow in Eq. (5.147b) means it is not an actual equality. Negative norm for the 
scalar polarization is implicitly assumed as shown by Eq. (5.140). To see the dif- 
ference between the two methods, we use the fact that in the transition amplitude 
they appear sandwiched by their sources, the conserved currents. As a typical ex- 
ample, take the matrix element for electron-muon scattering, which has the form 
[see Eq. (7.12)] 

— nf"' 

(5.148) 

Since the conserved current satisfies 

9„ j f = -► k fl j ■" = fe° j ° - | fc | j 3 = (5.149) 

in the coordinate system where the z-axis is taken along the momentum direction, 
Eq. (5.148) after summation of polarization states can be rewritten as 



fe 2 l t 2 ) [J0) fe 2 iki 2 fe 2 

(5.150a) 
= j-,4(fe-j). k-Jx = (5.150b) 



5.4 Spin and Statistics I 119 

The first term represents the contribution of an instantaneous Coulomb potential 5 ' 
and the second is the contribution of the current coupled to the transverse photon. 
In the Coulomb gauge, the two terms are treated separately and only the transverse 
photons are quantized. In the covariant gauge it is included in the contribution 
of the longitudinal and the scalar photon. Note, the proof was given in a special 
coordinate system, but Eq. (5.148) and the current conservation law Eq. (5.149) 
are written in Lorentz-invariant form. Therefore, a proof valid in one coordinate 
system is equally valid in any coordinate system connected by the proper Lorentz 
transformation. 

When the photon appears as a real (external) particle, we may still take the total 
sum instead of only transverse polarizations, as will be shown later in Eq. (7.54). 

In summary, longitudinal and scalar photons, which are artifacts of the covari- 
ant gauge, have unphysical properties, but they appear only in intermediate states, 
manifesting themselves as the instantaneous Coulomb potential. We can treat all 
four components of the electromagnetic potential in a Lorentz-covariant way and 
may safely take the polarization sum from to 3 to obtain correct results. 



Spin and Statistics 

We have seen that the creation and annihilation operators have to satisfy the com- 
mutation relation for the scalar and vector fields and anticommutation relation for 
the Dirac field. This was a requirement to make the energy positive definite. The 
relation can be generalized and the following well-known spin-statistics relation 
holds. 



Particles having integer spin obey Bose-Einstein statistics and those with half- 
integer spin obey Fermi-Dirac statistics. 



We shall re-derive the spin-statistics from a somewhat different point of view. 
We allow both commutation and anticommutation relations for the creation- 
annihilation operators and see what happens. Writing [A, B]± = AB ± B A, we set 

\a k , a[] = 6 W , [a k , a h ,\ ± = \a[, a[] = (5.151) 



We first calculate the commutators of the scalar fields. Using the above equations 

[0(«). *(y)]=F = E ^ e ~ iHx ~ y) T e,Hx ~ y) ] = '1 ( * ~ y \ < 5 - 152 ) 

V 2wV {iA 1 (x-y) 

9) (j°) 2 /k 2 is the Fourier transform of a static Coulomb potential V = p(x)p(y)/\x - y|. 
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iAAx - y) = [ d \ k r e -*H«-y) + e <*-(«-?)i 
i (2*)' ( ' 



| x _ y |2_ (x 0_ y0) 2 

for - (x - y) 2 » — (5.153b) 

where e(fc°) = ±1 for fe° sj and we used 

d(k 2 -m 2 ) = ^-[d(k°- E) + d(k°+ E)] , E=^/k 2 + m 2 (5.154) 

The function A (x) has the properties 

<d(*)Lo-o = 0, Aj (x) | =-a 3 («) (5.155a) 

[3,, 9'" + m 2 ]A(x) = (5.155b) 



It is Lorentz invariant, as can be seen from Eq. (5.153a), because the factors in 
the integral are all Lorentz scalars. This applies also to e(fe°), since it lies on the 
light cone (fe 2 — m 2 = 0) and does not change sign by any proper (orthochronous) 
Lorentz transformation: time-like (fe 2 > m 2 ) momentum vectors with fe° > al- 
ways lie in the forward light cone and those with fe° < always in the backward 
light cone. Since A (x— y) = for x° = y°, the Lorentz invariance dictates that it al- 
so vanishes at space-like distance [(x — y ) 2 < 0]. This is not true for the function A i . 
It is also Lorentz invariant but it does not vanish at equal time. In fact, according to 
Eq. (5.153b) it does not vanish outside the light cone, at least within a distance of 
the Compton wavelength of the particle. This means that if we adopt the anticom- 
mutator for the scalar field it is described by the function A \(x— y), and causality is 
violated. 11 ' Remember that in quantum mechanics, uncommutative operators obey 
the uncertainty principle and their values cannot be determined independently. On 
the other hand, if we adopt the commutator then it becomes A(x — y), the two mea- 
surements at space-like distance are not related and causality is respected. 

10) For the relation between discrete sum and continuous ir 

11) Strictly speaking we should talk about commutators of o 
related by simple algebra. 
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We can do similar calculations for the Dirac field: 

[V(*).^(y)]± = E JTV [ U r(P)^(P) e ~ lP ' {X ~ y) ± MPWr( P )e lPiX - y) ] 

= E 2^V^+ m ) e """ (X_y) ± (/- m)^'(^)] 

P ,3 ( 5 - 156 ) 

= ((iy+m)i2l(*-y) 

where $ = y"p„ is the Feynman slash. This shows that unless we choose the 
anticommutation relation for the Dirac field, the causality condition is not satisfied. 
Since fields of spin higher than 1/2 can be considered as being built up of products 
of spin- 1/2 fields, any (half) integer spin field has to satisfy (anti) commutation 
relations if causality is respected. 



Vacuum Fluctuation 

We have seen that the Hamiltonian of the fields [see Eq. (5.105a) and Eq. (5.121a)] 
has a constant term, i.e. the zero point energy. In the case of the harmonic oscil- 
lator, it represents the uncertainty of the position in confinement of the potential, 
and the resultant uncertainty of the momentum, in turn, produces the minimum 
energy even in the ground state. Namely, it is a quantum fluctuation associated 
with the uncertainty principle. In field theory it becomes a diverging vacuum ener- 
gy. Usually, observed energy is measured from a certain reference point and only 
the difference matters in ordinary situations. As long as the infinite energy stays 
constant, it does no harm, unless we are treating gravity. Experimentally, however, 
there is evidence that the zero-point energy is real. It is the energy of the vacuum, 
where "nothing!" exists. By nothing we mean every form of matter that is known to 
us and energy fields such as the electromagnetic field. It permeates space, so how 
do we know of its existence? The fact is that lacking a gravitational tool we cannot 
know its absolute value. But we can change the boundary condition surrounding 
the vacuum we want to measure and by doing so its variation can be measured. 
Other known examples of vacuum fluctuation are the Lamb shift, which is de- 
scribed in Sect. 8.2.1, and van der Waals forces, which acts on molecules. 

We emphasize the importance of understanding the vacuum energy for two rea- 
sons. First, the Standard Model of elementary particles has its foundation in the 
Higgs mechanism, which considers the vacuum to be a dynamical object that can 
metarmorphose and is indeed in a sort of superconducting phase as a way of gen- 
erating mass for the particles. Second, the recently established concordance model 
of cosmology acknowledges that 70% of the cosmic energy is the vacuum energy, 
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alias dark energy which causes accelerated expansion of the universe. Pursuit of 
the vacuum energy could well be the theme of the 21st century. 

5.5.1 

The Casimir Effect- 
Let us investigate the existence of the zero-point energy. As an example, consider a 
limited space between two large parallel metal plates. 

We idealize the plate by assuming it to be perfectly conducting and to have a 
square shape of size L 2 at a gap distance d <SC L, as depicted in Fig. 5.1; we ignore 
edge effects. The vacuum energy of the electromagnetic field is given by 

5>=^X>k, m = ^ fe 2 + k 2 + fe 2 (5>157) 

When the gap distance is sufficiently small, the wave modes that can exist inside 
the gap volume are limited. Only standing waves are allowed. We expect the vol- 
ume to be filled with less energy inside the boundary than outside the boundary, 
and there will be an attractive force between the plates, which is called the Casimir 
force [92]. 12) Although the zero-point energy in each case is divergent, the differ- 
ence turns out to be finite. The calculation goes as follows. The wave number, 
k z , that is perpendicular to the plates can have only discrete values fe z = njz/d 
(n = 0, 1, 2, • • • ), whereas it can be continuous if the plates do not exist. Only waves 
with transverse polarization contribute to the energy density: 

= --—^ f dk x dk y J dn2Jk 2 x + k 2 + (J^ (5.158a) 

(5.158b) 



In writing Eq. (5.158b), we have taken into account that when fe z = 0, the propa- 
gating waves in the gap can have only one polarization mode. Then the difference 

12) The sign of the Casimir force depends on the geometry. For instance, the geometry of two 
hemispheres causes a repulsive force [74]. 
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n the energy density is 

c _ -Egap - £f rec 






Proof: Equation (5.159) is apparently not well defined because it diverges as r 
oo. Therefore we regularize the equation by putting" 1 

f a (r,z) = J7T^e-«^+^ (5.1 

Then we can use the Abel-Plana formula 



t /(H) -f 



f(z)dz = -f(0) + 



. r w\ 



fM-fHV j 



-1 



3t quite an ad hoc patch up. For wavelengths shorter than the 

a smooth cutoff, which attenuates the wave when k S> 1/a, where a ~ atomic distance. In fact, it 
is known that above the plasma frequency m = y/ne 2 /mEo a metal becomes transparent to light. 
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Conditions for the formula to be valid are: 

(1) f(z) is analytic (differentiable) in the region Re(z) > 0. 

(2) lim y ^ ±00 e~ 2a ^f(x + iy) = uniformly for all x > 0. 

(3) linwoo/!^ e- 2 ^f(x + iy)dy = 0^ 

Obviously the regularized function f(z) = f dr f a (t, z) Eq. (5.160) satisfies the 
conditions (2) and (3) for a > 0. We note we can safely put a = in the r.h.s. of 
the Abel-Plana formula. When z + 1 2 < 0, we choose the branch of the square root 
function as follows to ensure condition (1): 

f(it) = f(-it) = Vr- t 2 for t 2 <z (5.162a) 

f(it) = -f(-it) = iVt 2 -r for t 2 > r (5.162b) 

Then Eq. (5.159) becomes 

=4r^r^ /(it) r /Ht) 

8d 4 io Jo e 2jrt - 1 

=-^f dT C dt ^ (5 - i63) 

^2 ,-oo ,-t 2 / t 2 _ T jj.2 ,-oo j3 

im - 



'~-<C d %> 



8jt 2SV ' 720d 4 

where the Zeta (£) function is defined in the region Re(s) > 1 as 

£(s) = f; i (5.164) 

which is known to be holomorphic (an analytic function) in all the complex s plane 
except at s = 1. □ 

Now that we have obtained the extra energy density in the gap, we can calculate 
the Casimir force per unit area that acts on the parallel plates: 

3 (£V\ n 2 hc 1.3 xlCT 3 , 

Fc — = -U UJ = "240^ = Jd^nV N/m (5 ' 165) 

Practically, the above value has to be corrected for nonperfect conductivity and for 
the finite temperature environment. When an electric potential of 17 mV is ap- 
plied between the plates at the distance of d = 1 um, the Coulomb force becomes 
stronger, and therefore it is a very difficult experiment to prove the formula. Ex- 
perimentally, it is difficult to configure two parallel plates uniformly separated by 
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a distances of less than a micrometer, a minimum requirement for the Casimir 
force to become dominant over gravity between the plates. Therefore, one plate is 
replaced by a sphere of radius R. For ft » d, where d is the minimum distance be- 
tween the sphere and the plate, the Casimir force acting on the sphere is modified 



The first successful experiment [249] proved the existence of the Casimir force to an 
accuracy of better than 5%. Recent experiments (Fig. 5.2) have reached an accuracy 
on the 1% level. 




■e 5.2 (a) The experimental ; 
of Harris et al. [197]. A metal sphere is 
ported by a cantilever. When the Casim 
acts on the sphere, the cantilever bend: 
its motion is measured by the displacei 



e laser reflection at the photodiodes. (b) 
rimental data on the Casimir force as a 
tion of the plate-sphere distance. The 
exactly fit the theoretical curve. 



126 I 5 Field Quantizatioi 



Strange Properties of the Vacuum 
Negative Pressure A vacuum filled with zero-point energy has the same density every- 
where. This means that the vacuum has negative pressure. Compressed air confined 
in a balloon has outward pressure. Does the negative pressure of the vacuum produce 
inward forces ? If we confine a small volume of vacuum in a cylinder and consider 
the work the vacuum does outside as depicted in Fig. 5.3, P = —p can be shown. 
Another way is to use the first law of thermodynamics. For an isolated system, the 
internal energy decreases if work is done to the outside world, namely dU = —PdV. 
Since U = pV, we obtain 



*L- 









S S Ax ' 

Figure 5.3 Negative pressure of \ 
urn. When confined vacuum in a b 
volume V expands, exerting force 
moving wall, the pressure is given 



P = F/S = -dE/dV = -(l/S)(3£/3x). 
Since the total vacuum energy in a volume 
is Pi/V(p„ = constant), P = — p v . 



by 



Antigravity We know our universe is expanding, and until only a few years ago its 
rate was thought to be decreasing because the gravitational force of matter in the 
universe counteracts it. Intuitively one might think the space filled with vacuum en- 
ergy would shrink even more. On the contrary, it accelerates the expansion according 
to general relativity. Namely, the vacuum energy has repulsive gravitational force. 
Whether the universe keeps expanding or eventually collapses depends critically on 
the balance between matter and vacuum energy in the universe. It is known that, 
in cosmic history, there were two epochs in which the vacuum energy dominated: 
the inflation epoch and now. The vacuum energy of the former could be of dynam- 
ic origin, where the field (dubbed inflaton) breaks a symmetry spontaneously and 
liberates the vacuum energy in the form of latent heat that has disappeared as time 
went by. But the vacuum energy (also known as dark energy), which occupies ~ 70% 
of the present cosmic energy budget, seems static, i.e. has constant energy densi- 
ty (see Section 19.2.7). This means it could also have existed in the inflation epoch. 
However, the observed value differs from that of the inflation by some 120 orders of 
magnitude! Remember, for instance, that a tiny quark (< 10~ 18 cm) could be larger 
than the whole universe (> 130 billion light years) if magnified by a mere 50 orders 
of magnitude. How could nature provide two kinds of energy of such magnitude? 
This is a big number. This is the notorious cosmological constant problem. The con- 
stant vacuum energy density means also that the total cosmic em 
e expands. Is the law of energy conservation violated? 



6 

Scattering Matrix 



Clues to clarify the inner structure of particles can be obtained, in most cases, by ir 
vestigating their scattering or decay patterns. In nonrelativistic quantum mechar 
ics, the transition rates of these processes are usually calculated using perturbatio: 
expansions. The underlying assumption is that the interaction energy, i.e. the ej 
pectation value of the interaction Hamiltonian, is small compared to the syster 
energy, which is represented by the total Hamiltonian H. The transition rate i: 
quantum mechanics is given by the so-called Fermi's golden rule: 

W fi = 2n\{f\T\i)\ 2 p (6.1s 

, (/|H,|n)(n|Hi|i) 



(f\T\i) = (f\H I \i) + Y l - 



Here, (f\Hi\i) is the matrix element of the interaction Hamiltonian between the 
initial state and the final state, p is the final state density, and in the case of a one- 
body problem with a scattering center it is expressed as 

where Vis the normalization volume. The purpose of this chapter is to derive rela- 
tivistic version of Eqs. (6.1a)-(6.1c) and prepare materials. Namely, we calculate the 
relativistic version of the vertex functions ((n|Hr| i)) of the electromagnetic interac- 
tion and the propagators [l/(£; — E„)] in Eq. (6.1b). QED calculation of the various 
processes, which amounts to assembling these materials in the form o 
matrices, will be dealt with in the next chapter. 



Interaction Picture 

In quantized field theory, the field operators in the Heisenberg picture obey the 
Heisenberg equation of motion: 



(6.2) 
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where "O" means any operator composed of the field <p and its conjugate n. States 
on which the operators act are the Fock vectors \a H ), which are constant in time. 
So far we have expressed every physical object in terms of the free fields. From now 
on we want to handle interacting fields. The Heisenberg picture naturally takes in- 
to account interactions when the Hamiltonian contains the interaction. However, 
generally we do not know how to express the exact solutions in analytical form, 
if they can be solved formally. The standard procedure is that we start from the 
free field, for which the exact solutions are known, and treat the interaction as 
small perturbation. This is the most practical way to approach realistic problems. 
It is convenient to introduce the interaction picture to handle the perturbative ex- 
pansion. To formulate it we start from the Schrodinger picture. In the Schrodinger 
picture, operators are constants of time and all the time dependence is contained in 
the state vectors. In the following, we attach the subscripts S, H and I for operators 
and vectors in the Schrodinger, Heisenberg and interaction pictures, respectively. 
The Schrodinger equation is expressed as 

i-|a s (t)> = H s |a s (t)> (6.3) 

A formal solution can be written as 

!«s(t)> = e- iHst |as(0)) (6.4) 

where | a s (0)) is to be determined by initial conditions. The Heisenberg picture can 
be obtained by a unitary transformation 



(p H = e lHst cp s e ,Hst , ji h = e lHst jr s e ,Hst (6.5b) 

H = e iHst O s (0s, ^s)e" iHst = 0(<p H , jt h ) (6.5c) 

H H = e ,Hst H s e- iHst = H s = H(0 h ,^h) (6.5d) 

Note, the last equation tells us that the Hamiltonian in the Heisenberg picture ex- 
pressed in terms of the Heisenberg operators is the same as that in the Schrodinger 
picture. So we shall simply use H to denote the total Hamiltonian in both pictures. 
Despite the time independence of the Heisenberg state vector, the time variable 
is retained. It does not mean the state changes with time, but rather it should be 
interpreted as reflecting the dynamical development of the operator at time t. The 
Heisenberg equation (6.2) can easily be derived from Eq. (6.5c). 



6. 1 Interaction Picture I 

The interaction picture is obtained by first separating the total Hamiltonian into 
the free and interacting parts: 

H = H + His (6.6a) 

|a,(t)> = e '*|a s (t)) (6.6b) 

0i(t) = e iHat (p s e- iHat , m(t) = e im jis,e- im (6.6c) 

The equation of motion that |cti(t)) satisfies is given by 

i^|a,(t)) = e im (-Ho + i^j |a s (t)> = Hi(t)|ai(t)) (6.6d) 

Hi(t) = e^'^ge-**' = Hi[0i(t),7Ti(t)] (6.6e) 

The operators in the interaction picture satisfy the free field equation 

d<Pi(t) 
dt 

where H = H {<j>\(t), Ji\(t)}. State vectors and operators in the Heisenberg and 
the interaction picture are related by a unitary transformation: 



|a H (t)) = < 


: iHt |as(t)> = e iHt t 


5- iH ° ! |«i(t)> = ^(^laiW) 


(6.7a) 


O h = LTf(1 


:) 0:17(4) 




(6.7b) 



The time dependence of the operators in the interaction picture is dictated by H , 
and that of the state vector by Hi. Therefore, if Hi vanishes, \a\(t)) does not depend 
on time and the interaction picture coincides with the Heisenberg picture. The 
equation of motion for the state vector |«i(t)) resembles the Schrodinger equation 
but the Hamiltonian H includes the free field operators that are time dependent. 
A convenient feature of the interaction picture is that the field operators always 
satisfy the equation of free fields and that the field expansion developed in the pre- 
vious section can be directly applied. That is no longer true for fields in the Heisen- 
berg picture. The interacting Klein-Gordon field, for instance, can be expanded 
just like the free field 

cp H (x) = J2 j= [a H (k)e- ik - x + b^k)^*] (6.8) 

However, the coefficients of the plane waves are related with those of free fields by 

« H (fe)= L/t(t)fl free (fe)L/(t), b H (fe)= W(t)bljk)U(t) (6.9) 

and no longer retain their simple physical interpretation as annihilation and cre- 
ation operators for single quantum of the field. Note, the interacting fields satisfy 



the same commutation relation as the free fields, because the commutation rela- 
tion is invariant under unitary transformation. 

What we shall need in the following is the transition amplitude from, say, a state 
\a(t')) at time t', which develops to \a(t)) at time t, then transforms to another 
state \0(t)): 

(f\i) = tfH(t)\a H (t)) = (/3s(t)|e~ 1H, '~°|as(t')) ( 6 -l° a ) 

where we have used Eqs. (6.5a) and (6.4). The transition amplitude does not depend 
on which picture is used because they are related by unitary transformations. Now 
we express it in the interaction picture. Using Eq. (6.7a), they can be expressed as 

(f\i) = (/8,(t)|a,(t)) = I (t)| 6 '»* 6 -' H l*-'V'Hi>*'|ai(t')) (6-lOb) 

]a,(t))= U(t,t')l«i(t')> (6-lOc) 

U(t,t') = e iHot e -iH(t-t') e -iH„t' _ rj( t )ut( t ') (6.10d) 

U(t, t') is referred to as the time evolution operator. U(t, t') is manifestly unitary 
and satisfies the following identity (for t\ > t 2 > t^): 

U(h,t 2 )U(t 2 ,t i )= UihJi) 
U[t 1 ,t 3 )U t (h,h)= U(hJi) (6.11) 

U(t, t) = 1 

If the interaction works only in a limited volume, an assumption which is gen- 
erally justified, it vanishes for t — > oo, t' — >• — oo in scattering processes. 1 ' Then, 
limt^-oo | cti(t)), limt^oo |/5i(t)) can be taken as free Hamiltonian eigenstates. 
We may call the states prepared before scattering "in" states and those after scat- 
tering "out" states. As they are both eigenstates of the free Hamiltonian and are 
considered to make a complete orthonormal basis, they are connected by a uni- 
tary transformation, which we call the scattering matrix and denote as S. Then 
Eq. (6.10b) and Eq. (6.10c) give 

S = U(oo,-oo) (6.12a) 

(f\i) = tfontlOh,) = </3 m |S|a m > = (/WISKut) (6.12b) 

</5outl = (An|S, or |a ln ) = S|a out ) (6.12c) 

The mean transition probability per unit time is given by 

Km l</WI"('.*>)l°.W>-*/il' (6 . 13) 

1) The interaction takes place typically within the scale of hadron size ~ 10" 15 m. If a particle 
traverses the region at the velocity of light, the typical time scale of scattering is ~ 10 -24 s. 
Comparatively speaking, the human time scale, for instance "a second", is infinite. For a 
long-range Coulomb-like force, the interaction range is, in principle, infinite but a screening effect 
due to the dielectricity of matter effectively damps the force beyond a few atomic distances. 



6.2 Asymptotic Field Condition I 

where by subtracting dfi, we have eliminated the process where the incoming 
particles pass without interaction. If the asymptotic states are represented by plane 
waves of definite momentum, the wave function extends over all space and can 
never be free. For correct treatment we need to use a wave packet formalism, but 
here we simply assume an adiabatic condition is implicit, in which the interaction 
Hamiltonian is slowly turned on and off at \t\ ->■ oo so that there is no energy 
transfer to the system. A mathematical technique to warrant the assumption is to 
modify 

H { (t) ->■ H{ , H, £ = Hie" k|t (6.14) 

and let e go to zero after all the calculations. This procedure is called the adiabatic 
approximation. 



Asymptotic Field Condition 

We consider an explicit representation for the interacting fields. In the relativistic 
action approach, the interaction Lagrangian is just minus the free Hamiltonian in 
the absence of derivative couplings. The fundamental interactions in the real world 
are governed by gauge theories. Other types of interactions appear generally as 
approximations in phenomenological approaches. In gauge theories, replacement 
of derivatives by covariant derivatives produces the interaction term and gives the 
source of the electromagnetic (or more generally gauge) field: 

lpy«id fl ip(x) -> yytiDrfix) = lpy f 'id fl ip(x) - qlpy^A, (6.15a) 

g^F"" = qj v = #y> (6.15b) 

The interaction of quarks is governed by QCD, which is also a gauge theory with 
the color degree of freedom obeying S 1/(3) internal symmetry. However, the pion, 
a composite of quarks, can phenomenologically be treated as a (pseudo) scalar field, 
which obeys the Klein-Gordon equation. Its interaction with the nucleon field ip N 
is described by the Yukawa interaction, whose interaction Lagrangian is expressed 
as 

£int = -g^(x)y 5 ^(x)jz(x) (6.16) 

The equation of motion can be expressed in a similar form as for the electromag- 
netic field. Setting gl/^x)/ 5 ^^*) = J 

K x <p(x) = (3,3" + m 2 ) x <p(x) = ;(*) (6.17) 

For the following discussions, we generically write the source term as J or J" , 
depending on whether it is a scalar or a vector, and do not concern ourselves with 
its detailed structure. 



Let us solve Eq. (6.17) formally. Denoting the Green's function by G(x — y), we 
have, by definition 

(3„9" + m 2 ) x G(x - y) = ~6 4 (x - y) (6.18) 

The general solution to Eq. (6.17) is the sum of the homogeneous solutions (free 
field) to the equation and the integral of the Green's function times the inhomoge- 
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We have to specify the boundary conditions to determine 0f ree . The Green's func- 
tion itself is not unique, either. Here we choose to use the advanced and retarded 
Green's function 

A ret (% - y) = for x° - y° < (6.20a) 

A a dv(x - y) = for x° - y° > (6.20b) 

Then the second term in Eq. (6.19) vanishes as t -*■ Too, depending on whether 
we use the retarded or advanced functions. Therefore, we may identify the free field 
as (p in , which are generators of the "in" and "out" states: 

<f>(x) = <M%) + f d 4 yA ret (x - y)J(y) (6.21a) 

4>{x) = 4> mt (x) + f d 4 yA adv (x - y)J(y) (6.21b) 
The "in" and "out" fields are related by 

4>m = Scp^S' 1 (6.22) 
This can be easily seen as 

»t(P) = S»L(P)S _I a ^(P') = S-'a^S (6.23) 

\Pin) ~ ai(p)\0) = Sal t (P)S- 1 \0) = S«L(p)|0> ~ S\p out ) (6.24) 

where ~ means "equal" except for the normalization. We also assumed the stability 
of the vacuum, namely S|0) = |0) apart from the phase, which can be set as 1. 

Since (p(x) approaches 0out(5c) asymptotically in the limit t — >• ±oo, we are 
tempted to think 

<P(x) '^ ±0 °> V20out(x) (6.25) 



6.3 Explicit Form of the S-Matrix I 133 

which is called the "strong" asymptotic condition. But it has been shown that if it 
is imposed as an operator condition, no scattering occurs. Therefore, we impose a 
"weak" asymptotic condition 

( _KmJ/|0(*)|i) = v^</|0out(x)|i) (6.26) 

The renormalization constant Z appears because, (p nas ability to produce extra 
virtual states and reduce the probability amplitude of remaining in a state that 0f ree 
can create. However, for most of the following discussions we shall assume Z = 1. 
We need to include Z explicitly when considering the higher order corrections, 
which we will treat in Chap. 8. 



Explicit Form of the S-Matrix 

We now want to calculate the time evolution operator. Inserting Eq. (6.10c) into 
Eq. (6.6d), we obtain an equation for U: 

i-U(t,t ) = H l U(t,t ) (6.27) 

The initial condition is 

U(to, to) = 1 (6.28) 

Using Eq. (6.28), Eq. (6.27) can be converted to an integral equation 

U(t,to) = l-if dtHi(t)U(t,to) (6-29) 

Expanding successively, 

U(t,to) = l-if dtH^ + i-i) 2 ! dhj 1 dt 2 H I (h)H 1 (t 2 )+-- (6.30a) 

.'. S = l-ij dtH^t) + (-if j dhj* dt 2 H l (t 1 )H l (t 2 ) + --- 

(6.30b) 

Interchanging the variables t\ -o- t 2 in the third term and changing the integration 
order gives 

j dhj 1 dt 2 H l (t 1 )H l (t 2 ) = j dhj dhHiihWh) 

= ±J dtiU ' dhHi(*i)Hi(t2) + j toiHi(h)Hi(h)\ (6-31) 



where T[- ■ • ] is a time-ordered product defined as 

T [Hl{h) H im =\ H ^ H ^ *>* ,6.32) 

Expressing Hi in terms of the Hamiltonian density 

H x {t) = j d 3 xX(t,x) = J d 3 xX(x) (6.33) 

The S matrix can be expressed as 

S = l-if d 4 xX(x) + t?L JJ d 4 x l d 4 x 2 T[J{ l (x l )J{ l (x 2 )} + ■■■ 

+ ^p jj — j d 4 x i d 4 x 1 ---d 4 x n T[X l (x 1 )X l (x 1 )---M I (x n )] + --- 

(6.34) 

Note that the last equation is a formal expression. Its real meaning is given by 
the preceding equation. As we can see from Eq. (6.34), the integration is carried 
out over all space-time and 3i\ = —£\ if the interaction does not contain time 
derivatives, and hence relativistically invariant. The effect of multiplying by T is 
a bit uncertain, but it does not destroy Lorentz invariance either, if the operator 
commutator vanishes in the space-like distance. 

Next we define the transition operator T and the Lorentz-invariant matrix ele- 
ment Mji by 

{f\S\i) = 6 fi -i2nd(Ei- E f ){f\T\i) 

= d fl -i(2jr) 4 d 4 (P l -P f )(f\M\i) (6.35) 



where P; and Pf represent the sum of the initial- and final-state energy- 
rum, respectively. Although we obtained a formal solution Eq. (6.34) for the scat- 
tering matrix, expansion up to second order is enough for most of our later treat- 
ments. In the nonrelativistic treatment, we may extract and separate time depen- 
dence explicitly. 

(/|Hi(t)|i) = (f\e iH ^H ls e- iH ^\i) = (f\H K \i)e-'^- E f^ (6.36) 

Inserting the above equation into Eq. (6.30), we obtain an equivalent formula to 
Eq. (6.35) as 

(f\S\i) = d fi - 2jtid(E t - E f )(f\T(E = E t )\i) (6.37a) 

</|T(£0|i) = </|H 1 s|i) + E </ '^ I !!g B ( +^ S|i) +"- ("*>) 
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where the adiabatic approximation H{ = f^e -6 ' 1 ' was used in the calculation. 
Thus we have re-derived Eq. (6.1b). We see the adiabatic approximation gives a 
prescription for handling the singularity at E; = E n . T(E) can be expressed in 
operator form as 

where we have used Hi for Hi(0) = H IS . This equation is the basis for the non- 
relativistic treatment of perturbation theory, but sometimes it is more useful than 
completely relativistic equations. 

Problem 6.1 

Replacing Hi with H/ of Eq. (6.14), derive Eq. (6.37) from Eq. (6.30). 

6.3.1 

Rutherford Scattering 

Before using the relativistic scattering amplitude, let us reproduce the Rutherford 
scattering formula for later comparison. This is scattering of a charged particle 
having spin by the Coulomb potential created by a scattering center. The matrix 
element in Eq. (6.37) is expressed as 

(f\H ls \i) = f d 3 XT/<;f(x)H IS Vi(x) ( 6 -39a) 

His = ^7 (6 - 39b) 

If the 3-momentum of the incoming and outgoing particles are assumed to be p ; 
and p j, respectively, the wave functions are given by 



where N = 1/>JV is a normalization factor, normalized as one particle per unit 
volume. Inserting Eqs. (6.39b) and (6.40) into Eq. (6.39a) 



;-/' 



VJ ' 4jiV J 

Putting q — p i —pj and carrying out the integration v> 

I = / d 3 x = lit \ dr dz r 2 



The integral diverges formally, but we may consider that the potential has a damp- 
ing factor e -/ "" due to the screening effect of the surrounding matter. Then 



q 2 + ft 2 q 2 
Inserting Eq. (6.42b) into Eq. (6.41) 

\ Ajt Za An 



fi + iqj 

(6.42b) 



\4jtV J q 1 V q 1 


a = — ~ (6.43a) 

An: 137 v ' 


q 2 = \Pi~Pf\ 2 = Ap 2 sin 2 - 


(6.43b) 


where 6 is the angle between p i and p r, ai 


ld IjPil = \p A = P by energy conser- 


vation. The final state density is calculated to be 




p 2 dQ mpdQ 


where we used E = p 2 /2m and integrated c 


>ver E. Inserting Eqs. (6.43) and (6.44) 


into Eq. (6.1a) 




Wr - 1 J^\ 2 ( A JL\\^P^l 


> _ p 42 a ro 


W ^- 2n \v) [qi) V 8.3 


- y ? 4 a " ( b ' 4b ' 



To convert the transition probability into a cross section, it has to be divided by 
incoming flux. The flux is the number of particles passing through unit area in 
unit time. When the wave function is normalized to one particle per unit volume, 
the incoming flux can be given as v/V, where v = p/m is the velocity of the 
particle, 

do 4Z 2 a 2 m 2 Z 2 a 2 m 2 

■'• TT7 = a = z ( 6 -46) 

dQ q 4 4p 4 sin 4 (0/2) V ' 

The normalization volume disappears from the final expression as it should. This 
is the famous Rutherford scattering formula and coincides with that derived clas- 
sically. 



Relativistic Kinematics 

6.4.1 

Center of Mass Frame and Laboratory Frame 

In dealing with reactions of elementary particles, there are two commonly used ref- 
erence frames. For theoretical treatments and for collider accelerator experiments, 
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the center of mass (CM) frame, in which total momentum is zero, is used. Before 
the advent of colliders, most experiments were of the type that a beam extracted 
from an accelerator hits a target at rest. This is referred to as the laboratory (LAB 
for short) frame, and most nuclear and some particle experiments today are of this 
type. They are connected by Lorentz transformation and relations that will become 
useful later are derived here. Let us consider two-body reactions 1 + 2 -» 3 + 4. We 
assign numbers to particles as follows: incoming (1), target (2), scattered (3) and 
recoiled (4), and denote 4-momenta and masses as pi, p 2 , pi, p\, mi, m 2 , m 3 , m 4 . 
We indicate variables in the CM frame by an asterisk. In the laboratory frame, they 
are expressed as 



p 1 = {Ei,p 2 ), p 2 = (m 2 ,0), 
and in the CM frame 

p* = (E*,p*), p* = (E*,-p 

p ; = (e:,- p '*) 

Graphically they are expressed as in Fig. 6.1. By energy-momentum 



p i = (E i ,p i ), p 4 = (E 4 ,p A ) 



), p* = {e;, v '*), 



(6.47) 



--■^- 




(a) (b) 

Figure 6.1 Two-body scattering in (a) the laboratory frame and (b) the center of mass frame. 



Pi + Pi = Pi + P4 (6.49) 

Consider the 1 + 2 particle system as composing a single particle having £ tot = 
Ei + E 2 and p tot = p x + p 2 , then in the CM frame the energy W = E t * t can 
be considered as the mass of the two-body system because p* ot = by definition. 
The Lorentz factors /3 and y to transform the LAB frame to the CM frame can be 
obtained by applying one-body formulae (fi = p/E, y = E/m) to the system, i.e. 






h m 2 



E iot Ei + m 2 ' Jl-P 1 

W = (mi + 2E t m 2 + ml) 112 



(6.50a) 
(6.50b) 
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.LAB 
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Now that we know the Lorentz factors to connect variables in the two frames, we 
can apply the standard Lorentz transformation formula to each particle: 

(6.51a) 
(6.51b) 



where p\\, p± are components of the momentum along and perpendicular to the 
z-axis (direction of CM frame motion): 



An important point here is that the transverse components are not affected by the 
Lorentz transformation. This fact is extensively utilized in analyzing particles, for 
example sorting b-quarks out of many-particle jets. 2 ' 



Prove that particles that are distributed isotropically in the CM frame (dN/dQ* = 
constant), are distributed uniformly in energy in the LAB frame (dN/dE = 
constant). 

Although Eqs. (6.52) are the bases for connecting variables in the two frames, it 
is more convenient in actual calculations to use the Lorentz-invariant Mandelstam 
variables, defined as follows: 

s = (Pi + Pi) 2 = (£i + E 2 ) 2 - ( Pl + p 2 ) 2 (6.53a) 

t - ( Pl - Pi ) 2 = (E 1 - £ 3 ) 2 - (Pi ~ Pif (6.53b) 

U = (pi - P4) 2 = (Ei - E 4 ) 2 - ( Pl - p A ) 2 (6.53c) 

As they are Lorentz scalars, their values do not change between frames. For in- 

((£,* + E*) 2 :CM 

S =\ V , 9 ( 6 - 54 ) 

\(E 1 + m) 2 -p 2 = m 2 + 2E l m 2 + m 2 : LAB 

As s = W 2 , s can be considered as the invariant mass (squared) of the two-body 
system. Writing s in terms of CM variables 



V^ = W = E* + E* = Jp* 2 + m\+ Jp* 2 + m\ (6.55) 

2) B hadrons, which contain a h (or b) quark, have mass around ma ~ 5 GeV. When they are 
produced in high-energy collisions, they typically appear as jets in which many hadrons fly 
more or less in the same direction having high longitudinal momentum but limited transverse 
m B /2 relative to the jet axis. 
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we can solve the CM energy in terms of s 

V-LtjgzA, * 2 * = ^^ (6.56a) 

p* = Je* 2 - m\ = 2 "^ = ^A(l, X!, x 2 ) (6.56b) 

A(x, y, z) = (x 2 + y 2 + z 2 - 2xy - 2yz - Izx) 1 ' 2 (6.56c) 

Xl = m\ls, x 2 = m\\s (6.56d) 

|t| is referred to as the momentum transfer (squared) and can be written down 
similarly 

t = (E 1 - £ 3 ) 2 - (!»! - P 3 ) 2 = «? + m\ - 2(£ x £ 3 - I^IIPjl cos 6) : LAB 

(6.57) 

6 is the scattering angle and is defined by cos 8 = (p 2 -Pi)l\Pi\\Pi\- m most high- 
energy experiments we are interested in £ J>> m, E ~ p = \p\, then 

t ~ -2 Pl p } (l - cos 0) ~ -4£ x £ 3 sin 2 - (6.58) 

t has the same form either in LAB or in CM. The role of u is similar to t, except 
particle 3 is replaced by particle 4. In CM, t and u are expressed in the relativistic 
approximation (£ S> m), 



u~-2p*p*(l + cos6) (6.59b) 

From the above formula, we see that s gives the total energy, while t, u give the 
measure of the scattering angle. When the masses can be neglected, i.e. s 3> m 2 , 
the ranges of the variables are s > 0, t < 0, u < 0. 

Problem 6.3 

Show that s, t, u are not independent but satisfy the relation 



s+t + u=J2 n 



6.4.2 

Crossing Symmetry 

The physical meanings of the three variables are apparently different, but consid- 
ering the fact that mathematically antiparticles appear as having negative energy- 



momentum and as backward-traveling particle, we can show that the roles of s, t 
and u are equal and symmetric. Take the example of jr-proton elastic scattering 
(Fig. 6.2a) 

JZ + (Pi) + p(p 2 ) -* Jt + ( Pi ) + p( P ,) (6.61) 

By changing the variables 



e call "( 



ing", the physical 



(6.62) 
g of the scattering is modified to 



P(P4) + P(Pi) -> JT-( Pl ) + jr + (pi) (6.63) 

Because by the change in Eq. (6.62) the Mandelstam variables are changed to 

s = (Pi + Pi) 2 = (~P4-Pi) 2 -+u t - (p 4 - Pi ) 2 ~ -2p?(l + cos d t ) (6.64a) 
* = (Pi- Pif = (~J»4 + P2) 2 -» s t - (p 4 + p 2 ) 2 ~ 4p 2 (6.64b) 



= (Pi- 



uY -**, = (, 



n) 2 



"2p 2 (l- 






(6.64c) 



p t represents the momentum in the pp CM frame, # ( is the angle that the outgo- 
ing n~ makes with the incoming antiproton p. Here, t > 0, s, u < and t s s t , 
u = t t , s = u ( play the role of s, t, u in reaction (6.61). The reactions (6.61) and 
(6.63) are referred to as s-channel and t-channel reactions, respectively. Similarly, 
the u-channel reaction (Fig. 6.2c) can be defined. Thus, the three variables s, t, u can 
be negative as well as positive and, depending on their value, represent three differ- 
ent reactions. This means that if the scattering amplitudes are written in terms of 
s, t, u, other crossed-channel reactions are obtained immediately by crossing. Fig- 
ure 6.3 shows the three physical regions in an oblique coordinate frame, for the 
case where the pion mass is ft and the proton mass M. 



■ L-llJ ll*rl — * 




i» t- 1- ,,-* r- - p .jL) ,, - f, + t- 1- r ■ \C\ T-1- |t- r" - f, 
Figure 6.2 Crossing symmetry. Mutual relations of three different scatterings. 
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Figure 6.3 Three physical reg 

represents (s, t, u) by its signed perpendici 



Relativistic Cross Section 



6.5.1 
Transition Rate 



We first derive the relativistic version of Eq. (6.1a) (Wfi = 2ji\Tfi\ 2 p). The mean 
transition probability per unit time is given by Eq. (6.13), so referring to Eq. (6.35) 

I S fi - d fl \ 2 = | - (2n)Hd\ Pl - Pf )M fl \ 2 

= (2^d*( Pi - p f )\ [ d*xe^-»fy*]\M fl \ 2 

\f 1 (6 ' 65) 

= (2jz) 4 d 4 ( Pl - Pf )y d 4 x^\M fl \ 2 

= (2jzfd 4 ( Pi - Pf )VT\M fi \ 2 

Here, the 6 -function was first expressed as an integral and we used the fact that the 
total integrated volume V T is large on microscopic scale but finite on macroscopic 
scale. The transition rate per unit time per unit volume is obtained by dividing Wf ; 
by VT and multiplying by the final state density pf. 

W fi = (2nfd 4 ( Pi - Pf )\(f\M\i)\ 2 Pf (6.66) 

Then the cross section is obtained by dividing Wfi by the incoming flux and target 
particle density. 



6.5.2 

Relativistic Normalization 

In the relativistic treatment, the usual normalization condition 

Jp(x)dV = l (6.67) 

is unsatisfactory. For flying particles the volume element d V is Lorentz contracted, 
and the space integral in the rest frame overestimates the volume by y = E/m. 
It is better to adopt a normalization that is proportional to y . Therefore, starting 
from this section, we adopt relativistic normalization, which we choose to set 

JpdV = 2E (6.68) 

Writing plane-wave solutions obeying the Klein-Gordon equation as <p = Ne~ lp ' x 
and those obeying the Dirac equation as \p = Nu r (p)e~ l P' x , the normalization 
conditions for both of them are 

pd*x= d i xi[<p*tp-(p*<p]=2EN 2 V boson (6.69a) 

D = c J d i xip*rp = N 2 Vu[(p)u{p) = 2EN 2 V fermion (6.69b) 

In both cases, they give N = ll~jv. The condition Eq. (6.68) means the wave 
functions of the Dirac and Klein-Gordon fields are normalized as 



U pr = u r (p)f p (x) fermion (6.70b) 

In the above treatment we integrated the density over a large but finite volume V = 
L 3 , which in turn means that the momentum is discrete. If we impose periodic 
boundary conditions for the plane wave, 

e ip*(*+i) = e i ?* x ,^p x L = 2nn x , n x =0,±1,±2,--- , 

III ( 6 -71) 

An x = — Ap x , An y = —Ap Y , An z = —Ap z 

then the number of wave states in the volume is given by 

p = {p x ,p y ,p z ), piL = 2jtrii Hi =0,±1,±2,--- (6.72a) 

A» = (J^j Ap (6.72b) 

The discrete sum becomes an integral in the large volume limit: 

E-Z>-(5)'l>F-''/!£ <«*) 
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The integral of the plane waves in discrete or continuous format is 

I f d* x e «p-p'>* = d py ^^ ^fdH P - P >) 



LJ-i 



i>ip*-p' x ) 

p x L = 2jtn, (ra = 0,±l,±2---) 



The one-particle state orthogonality and complet 
{p'\p)= ( ^-2E p d i {p'-p) 



(P* = P'J 
p' x )=0; ( Px ^ P ' x ) 



J (2nf2E v 



P)(P\ 



The commutator of the discrete creation/annihilation operators becomes in the 



Then the one-particle state is expressed in terms of 

\p) = V2Eaf(p)\0) (6.77) 

Having defined normalizations with finite V, we set the volume as 1 from now on 
unless otherwise noted, because they always disappear from the final expression 
for measurable quantities. 3 ' 
With the above definitions, the expansion formulae for the fields now become 

?(*) = E j^ [«*»""■* + b l eik ' x ] 

= [ —\a k e- ik - x + b~le ik - x ] : scalar (6.78a) 

J {271)^42^ L k J 

y>{x)= [ \ P Y [a. u r (p)e- , f x + b\v(p) r ^- 

K ' J (2jt) 3 /2E^ *-f L p nFI p KF ' 



(6.78b) 



J (2ji\ 



(2tt) 3 V2w 
J2[ak^"(h^e- ,k - x + al A e"(k,A)*e' k - x ] : photon (6.78c) 



3) We shall prove this for the case of the cr 



The matrix elements of the field between the vacuum and one-particle state appear 
frequently and we give them here: 

(0\<p(x)\k) = e- lhx (k\ v <'(x)\0) = e ihx (6.79a) 

(0\^(x)\p,r) = u r (p) e - i P- x (p\lp(x)\0) = u r ( P )e l >>- x (6.79b) 

(0|A*(*)|M) = £ "(M)e-** <fc|A"(x)|0) = e*(M)V*-* (6.79c) 

Note As we said, taking the continuous limit does not necessarily mean V is 
infinite. Therefore d (0) should be interpreted as 



**-M*-ik 



6.5.3 

Incoming Flux and Final State Density 



n the relativistic normalization, the final state density of one particle is given by 

(6.82) 



V d } p 



for each particle in the final state. Let us name the incoming particle 1, the target 
particle 2 and assume Nf particles are produced in the final state. The incoming 
flux is given by 2£\ v u / V, where v 12 is the incoming velocity relative to the target, 
and the target particle density is given by 2E 2 / V. Then the cross section for the 
n becomes 

2 / N f \ N f 

--j^iMf^^^+pt-^pjn 



(2jt)i2E f J 

(6.83) 



We define Lorentz-invariant flux F by 

I V\ Vi\ Up*W CM 
F = 4£ 1 £ 2 vi2 = 4£ 1 £ 2 I — - — = { 

\E 1 E 2 J \4 Pl m 2 LAB 

= 4[(p 1 -p 2 ) 2 -m 2 1 m 2 2 ] 1/2 = 2sl(l,x 1 ,x 2 ) 
X(x, y, z) = (x 2 + y 2 + z 2 - 2xy - 2yz - 2zxfl 2 
x 1 = m 2 /s, x 2 = mj/s 



6. 5 Relativistic Cross Section I 

As the state vector contains a factor 1/Vv for each particle, <A{y; produces a total 
of (Nf + 2) 1/a/V. Then naturally, all the volume Vs are canceled and disappear 
from the expression of da. As V always disappears from the final expression for 
measurable quantities, there is no harm in setting V = 1. 

6.5.4 

Lorentz-lnvariant Phase Space 



We introduce Lorentz-invariant phase space by 
dLIPS = (: 



■<^*(l>-I>)anj5j% (68S| 

where m! is a correction factor if identical particles are included in the state. Insert- 
ing Eqs. (6.84) and (6.85) into Eq. (6.83), we obtain the cross-section formula 



The decay rate for particle 2 into many particles dF (2 — ► 3 + 4 + ■ • ■ ) is obtained 
by dividing the transition rate Wji = W(2 —> 3 + 4; • • • ) by the flux of particle 2 in 
its rest frame 



Equations (6.86) and (6.87) are the relativistic version of Fermi's golden rule 
Eq. (6.1a). 

6.5.5 

Cross Section in the Center of Mass Frame 

Let us calculate the cross section for two-body scattering 

1+2^3+4 
n the CM frame. Using Eq. (6.83), the cross section for two-body scattering is given 






-\M fl \ 2 (2jt) 4 d\ Pl + p 2 - Pi - P4 ) f 3 f 4 (6.88) 



lnCM, Pl +p 2 = 0,E 1 + E 2 = W. Writing^ = \ Pl \ = \-p 2 \, p f = \p 3 \ =\-p 4 \, 

Ht-tr^^ = fe (6 - 89a) 

.-. F = 4E l E 2 v u = 4 Pl W (6.89b) 



dLIPS CM =^d(W-E i -E 4 )d i (p i + p 4 )^^ (6.89c) 

/ d 3 p 4 gives 1. Then to carry out p 3 integration, we use the relation d(G)dpj = 
(dG/dp f )- 1 d(G)dG: 

G = £ 3 + £4 - W = 7p}+~m| + Jp} + m 2 - W (6.89d) 

(6.89e) 



9G _ Pf p f _ p f W 

dp f £3 £4 £3 £4 



dLJPS _ 1 / dG \~ l PjdUcu l 

P ~ 4p~W \dpj) 16jt 2 £ 3 £ 4 ~~ 64jt 2 < 



We obtain 






da 

di3| CM - 


(£ 


1*/ 
64 71 


Problem 6.4 







(6.89f) 
(6.90) 



Prove that the cross section in LAB for the case m\ = m 3 = 0, m 2 — m 4 = 1 
given by 

do] _(Pi\ 2 \M fi \ 2 
dfiL B UJ 64tt 2 M 2 



Problem 6.5 

Sometimes it is useful to express the cross section in terms of the Lorentz-invariant 
variable t rather than the scattering angle cos . Assuming mi = m?, = m, m 2 = 
m 4 = M, show that 

dt = Ipxpidcosd = ^^-dQ ~ ^ -dQ CM (6.92a) 



•Wfil 2 

(6.92b) 



dt piPidQlcu 16jrl(s, m 2 ,M 2 ) 2 1(,tc(s-M 2 ) : 

where the last equation in both lines holds when s ^> m 2 . 
Hint: Use 

t = q 2 = (px — p^ 2 = 2m 2 — 2(E 1 E i — pxpicosd) in any frame (6.93) 

and in CM 

Vs / m 2 M 2 \ s-M 2 



6.6 Vertex Functions and the Feynmi 



Vertex Functions and the Feynman Propagator 

In Sect. 6.3 we derived an explicit formula for the scattering matrix that is the 
relativistic version of Eq. (6.1b): 

wm = (flw) + j: {m ™} n l m (6-95) 

(/|Hi|() is the first-order matrix element of the interaction Hamiltonian, but in 
Feynman diagrams, which will appear in abundance in the following, it is referred 
to as the vertex function and l/(£; — £„) as the propagator, the relativistic version 
of which is referred to as the Feynman propagator. It is necessary as well as eluci- 
dating to learn beforehand how elements (f\Hi\i) and 1/(E; — £„) that appear in 
the above formula are expressed in quantized field theory. To calculate the vertex 
function, we need an explicit form of the interaction Hamiltonian. In the following, 
we mainly discuss the electromagnetic interaction. 

6.6.1 

eey Vertex Function 

As was described in Chap. 4, the interaction of electrons (q = —e) with the elec- 
tromagnetic field is given by the gauge principle, which is to replace derivatives 
by covariant derivatives 3 /( ->■ D M = 9„ — ieA fl . This replacement produces the 
interaction Hamiltonian density M\ for a Dirac particle 

Ml = -X fat = qlpyfipAf, = qj> l A fl (6.96) 

Similarly, the interaction Hamiltonian for the complex scalar field is given by 

£ = d^d^cp - rn^V => £ + £i 
3t\ = qAp [i (ip^d^q) — S'^'ip)] — q 2 A /l A , '<p^<p 

Consider a transition from an initial state with an electron having the energy-mo- 
mentum p and spin state r to a final state with an electron having p',s and a pho- 
ton with fe and polarization X. The expectation value or the matrix element of the 
Hamiltonian is given by 

(p's; U\Xi(x)\pr) = - e (p's\lp(x)y^(x)\pr)(U\A li (x)\0) (6.98) 

Here, we have separated the electron and photon, because their Fock spaces are 
independent of each other. We have also expressed the sign of the charge explicitly 
(q = — e). The expectation value of the electromagnetic field is readily obtained 
using Eq. (6.78c) 

<fcA|A„(x)|0) = ^(M)*^'* (6.99) 



We use Eq. (6.78b) for the expression of ip, ip. Considering \pr) ~ aj, r |0), (p's\ ~ 
{0|o p / s , the expectation value of the current ~ Ipxp contains a term 



where we have used the abbreviated nota 


tion p', p, t, s to denote both 


and spin states: 




« ta t=[{« t ,«t } _ fl t flt] | 0) : 


= (27r) 3 <5 3 (t-p)|0> 


<oiv«:=<oi[{ V «o_ a t V ] 


= (0\(2jr) 3 d i (s-p') 



In deriving the last equation of both lines, we used the fact that the annihilation 
(creation) operator acting on the vacuum state from the left (right) annihilates the 
states. Therefore 

(0|a,/aja,aj|0) = (2ufd\t- p)d\s- p') (6.102) 

Taking into account the normalization factor ( ^/2E p • • • ) for the states and integrat- 
ing over t, s (note we are also taking the sum over spin) 

(p's\lp( X )y^(x)\pr) = u s (p')y"u r (p) e -^-p'^ (6.103) 

Therefore 

(a)e^e+y 

, , (6.104a) 

(p's; kWdp) = -eu s (p')y"u r (p) £fl (k,l)*e-^-P "*)■« 

This matrix element corresponds to diagram (a) of Fig. 6.4. Notice the sign of the 
energy momentum that appears in the exponent. The sign of the energy-momen- 
tum p of an incoming and annihilated particle in the exponent is negative and that 
of outgoing or created particles is positive. As the interaction can happen anywhere 
in space-time, the matrix element will be integrated over space-time, producing 
(5-functions. Note, the action is an integral of the Lagrangian over all space-time. 
Namely, the energy and the momentum are conserved at each interaction point or 
at the "vertex" of the diagram: 

E = E' + co, p = p' + k (6.104b) 

In a similar way, the diagram in Fig. 6.4b corresponds to 

(b)y + e^e 

; o_ t , (6.104c) 

(p'smpr.M) = - e u s (p')y"u r (p) E/l (k?.) e -^-P +*>■* 

Next, we consider the interaction of an antiparticle. Corresponding to the diagram 
in Fig. 6.4c, 

(c) e+ -> e+ + y 

, , (6.105a) 

(p's^MMApr) = +ev r (p)y"v s (p') £ll (U)* e -^-P- k y x 
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Figure 6.4 Various processes of the electron-photon interaction in the I 

given by Xint = — qA fl j" . Time goes upward; a line with an upward-going arrow designates a 

particle and one with a downward arrow, an antiparticle. 

Here again, the sign of the energy-momentum in the exponent gives (— ) for incom- 
ing and (+) for outgoing particles and there is no distinction between the particle 
and antiparticle. The remnant of the statement that the antiparticle goes backward 
in time is on the order of v r (p)Y , 'v 5 (p'). Notice also that the sign of the electric 
charge is correctly given, i.e. opposite to the electron, as it must be. The contribu- 
tion to the matrix element is a product of b s v(s), which is in the positive-frequency 
part of xp, and b] v(t), which is in the negative-frequency part of tp 

(p'\b s b<\p) = (p'\{b s bl}-b<b s \p) = {p'\(2x?d\s-t)-b\b s \p) 

The second term gives the above matrix element with the correct negative sign. 
The negative sign results from the anticommutation relation, but it also produces 
the first term, an unwanted constant contribution, which gives a diverging inte- 
gral. This first term can be treated in a similar manner as the constant term in 
the Hamiltonian, namely such an unphysical term is to be subtracted at the very 
beginning of the calculation. The technique is to assume that terms like b s b\ in 
the bilinear expression Ipxp , in which the annihilation operator comes to the left 
of the creation operator, are already ordered like —b\ b s from the beginning. Op- 
erators arranged in such a manner are called "normal ordered" or "N product" in 
quantized field theory. We shall assume that the interaction Hamiltonian is normal 
ordered. 4 ' The diagram in Fig. 6.4d is quite similar to Fig. 6.4b with appropriate 
changes (u r (p) -+ v s (pr),u s (p') -+ v r (p), -e ->• +e) 

(d) Y + e + ^e+ 

, , , U.M ( 6 - 105b ) 

ffsWlprM) = +ev r (p) r u v 5 (p')^,(kl)e-^-P+ k y x 



4) The normal ordering is a formal manipulation to avoid unwanted constants, which are often 
infinite. Whether this is physically allowed is a matter to be treated carefully. For instance, the 
vacuum energy that was inherent in quantized harmonic oscillators can be eliminated, but we 
have seen that its existence is experimentally established by the Casimir effect [92]. 



Figures 6.4e,f, which describe pair creation or annihilation by the photon, are e: 
pressed as 



to Y -+ «" + e + 

(/)e - + e +^ y 

(W|*,|j»r,^s) = - e y s (p')y^ r (P)^(^)^"' (; ' +; ''" t) " c 



(6.106a) 

(6.106b) 

Finally Figs. 6.4g,h, which describe total transition to vacuum and vice versa, are 
expressed as 



(6.107a) 
(6.107b) 



(g) Y + e~ + ^ 

«|pr,]f s; U) = - e v s (p')y»u r (p)s fl (U)e-'tr+r'+ k y* 
(h) vacuum -+ e~ + e+ + y 

(pnfs; U|Jifi|0) = - e u r (r)y"v s (p>«(^)^ i(, ' +p ' + ' :) ' x 



Virtual Particles 

In quantum field theory, we describe most physical phenomena in terms of parti- 
cles. However, in the case of interacting electromagnetic fields, for instance a static 
Coulomb field between two charged particles, no particles are emitted, yet we talk 
of the force as the result of particle exchange. What is the difference between a real 
photon, which appears as a quantum of freely propagating waves, and the quantum 
of the particles exchanged between two interacting objects? The freely propagating 
wave satisfies the Klein-Gordon equation in vacuum. Then Einstein's relation 

particle (6.108a) 



should hold for the field quanta. A particle satisfying Eq. (6.108) is often said to 
be "on the mass shell" or simply "on shell". The electric field that is produced by a 
point charged particle satisfies 

d fl d f 'A r (x) = e<5 3 (r) ->• fe 2 /0 (6.109) 

The quantum of the interacting field, therefore, does not satisfy Einstein's rela- 
tion. It is said to be "off shell" or a virtual particle. It is not a physical particle and 
cannot be observed as such. The reader is reminded that for the virtual particle, 
the ordinary physical picture does not apply. For instance, if the force is created 
by exchange of a real particle, it can only give a repulsive force; no attractive force 
can result because emission or absorption of a particle having momentum always 
causes the party involved to recoil. Another example: the real photon has only two 
degrees of freedom in its polarization. Since the polarization is a vector in space- 
time, it needs four components in a relativistic treatment. Artificially (mathemati- 
cally) introduced extra quanta (longitudinal and scalar photons) appear only in the 
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intermediate state of a process. They show peculiar properties such as negative 
probability, superluminal velocity 5 ' and nonzero mass. 

The vertices described above generally connect initial states to intermediate and 
final states. Initial and final states that are prepared by experimental set ups are 
composed of real (on mass shell) particles that satisfy the Einstein relation E = 
^/p 2 + m 2 , cd (photon) = ]fc|, or in relativistic notation p 2 = m 2 , k 2 = 0. This 
means that once the momentum p of a particle is specified, its energy is fixed, too. 
At the vertex, both the energy and the momentum are conserved in a relativistic 
treatment, which in many cases does not allow all the particles to be "on shell"; 
one or two or all of the states are off mass shell. An exception is an initial massive 
particle (say a n) converting to two less-massive particles (/u + v), where all the 
particles could be "on shell" if energy-momentum conservation is satisfied. 

6.6.2 

Feynman Propagator 

We have so far studied only free fields, which satisfy a certain equation of motion, 
a homogeneous differential equation. An interacting field, which is created by a 
source, obeys an inhomogeneous differential equation. A formal solution to the 
interacting field is given using a Green's function. In quantized field theories this 
is the role played by the Feynman propagator. It appears everywhere and is an 
indispensable tool for the calculation of scattering matrices. The definition and the 
properties of the Feynman propagator are given below. 

Feynman Propagator of the Photon 

The Feynman propagator of the photon is defined as 

iD V/iV (x-y) 6 >=(0\T[A4x)A v (y)]\0) 

= (<0|A„(%)A„(y)|0) t x >t Y (6.110) 

" j(0|A„(y)A„(x)|0) t x <t Y 

T(---) is the "T product", defined in Eq. (6.32). Inserting a unit operator 1 = 
J2 \n){n\ and considering that A ft has matrix elements only between states dif- 
fering in particle number by 1, only one-particle intermediate states differing in 
momentum or spin contribute. Inserting Eq. (6.75) and considering 

(0\A fl (x)\qA) = £fl (X)e-^ x (6.111a) 

{qX\A v (y)\0) = e,(A)*e'« (6.111b) 

J2 e fl {l)E v {lf ->■ -g /lv see Eq. (5.147b) (6.111c) 

5) The space-like momentum fc 2 < means its (formal) velocity is larger than the velocity of light. 

6) It is customary to write D e for particles with m = and A F when m 2 ^ 0. This applies to other 
functions, e.g. A, Arct, too. 



we can express the Feynman propagator as 

(0\T[A„(x)A,(y)]\0) 
d 3 q 



- g "7<^ 



i(o ,6(x- y )e -r(«-« + 6 (y - *) e *n*-W} (6.112a) 

(l t>0 
*(*,= j o t<() (6.112b) 

The step function can be expressed by a contour integral in the complex plane: 

6(t) =2^ilZ> dZ ^~i~£ (6 ' 113) 

where ( e is a small imaginary part prescribed to handle the singularity. Then 

(0\T[A fl (x)A v (y)]\0) 

d 3 q 1 



r ii0) - z)t '+ iq . x > 

*f dz [-^ie— 

(2tt) 4 y 2w y y L?°- 



(6.114a) 

= — ^ f d 4 q 6 I (6.114b) 

(2?r)4 y %2 + is [ ' 

In summary, the Feynman photon propagator is expressed as an integral 

(0\T[A u (x)A v (y)]\0) - iD F „ F (x - y) = ^|f J d 4 q * 2 '* + 'J (6-115) 

It is a vacuum expectation value of two field operators that create a photon at y 
and annihilate it at x, and is referred to as a two-point function of the field. In 
the language of classical field theory, it is the effect of the electromagnetic field 
created by a source (a charged particle) at a point y, acting on a charged particle at 
point x. That the time ordering is considered suggests the role of a Green function. 
We shall see that the Feynman propagator D F is indeed a Green's function of the 
electromagnetic field that satisfies the equation 

d u d"A"(x) = qj v (x) (6.116) 

Green's Function 

Let us start from the simplest relativistic field (p(x) that satisfies a spin-less Klein- 
Gordon equation. The field produced by a source obeys the equation given by 

[9„3" + m 2 ] 4> = p(x) (6.117) 
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where p is a source to produce the field. A formal solution to the equation is 
pressed using a Green's function G(x, y), which satisfies 



0(x) = 0o(*)- f d 4 yG(x,y)p(y) 

(d p dP + m i)G(x,y) = -d\x-y) 

(d p dP + m 1 ) O (2) = 
The physical meaning of the Green's functio: 



(6.118a) 
(6.118b) 



is that the field at space-time point x 
jurce at space-time point y that then 



;t frequently translationally invari- 



is given by the action of the field created by a 
propagates to point x. 

The boundary conditions imposed on G is 
ant, so G can be regarded as a function of x — y . Then it can be solved using Fourier 
transformation: 



(6.119a) 
(6.119b) 



Inserting Eq. (6.119) into Eq. (6.118b) we obtain 

^>=^3^r q 2 = (q°) 2 ~ q 2 , a, = Jq 2 + m^ (6.120) 

The Green's function is expressed as 

1 [ , e-»9-( x- v) 

G(x,y) = / d 4 q— - 

[ Y} (lit)" J %1-mi 

(6.121) 

The integration on g° can be interpreted as a contour integral in the complex plane, 
but a rule has to be established to handle the singularity. The retarded and advanced 
Green's function can be defined by requiring G(x — y) = for t x < t y and t x > t y . 
For t x > t y (t x < t y ), the contour can be closed in the lower (upper) semi-circle, 
therefore, adding a small imaginary part, q° -*■ q° ± is, gives the desired effect: 



(2^> 4 y 


1 


y d? 2cy Uo ±ie _ ft 

= ^ ^°-^/(2^^ 
_ (e(*»-y°)A(*-y) 

j-0(y°-x<>)A(x-y) 


-(«-y) _ 6 »r(«-y)j 



(6.122b) 

where e(k°) = ±1 for fe ^ 0. In relativistic quantum theory, we have learned that 
the negative energy contribution should be interpreted as a particle proceeding 
backward in time. Then the contribution of the negative energy should vanish for 

t X > ty. 

To make t~ iq ' x an ordinary propagating wave when q° = cu > and a backward- 
traveling wave when q° = — a> < 0, we have to close the integral path in such a 
way as to include the pole at q° = w when t > and q° = —oj when t < 0. 
This can be achieved by adding or subtracting a small imaginary number ie to the 
denominator. Using 






i 
" q° + o> - 



(6.123a) 



the Green's function for the quantized Klein-Gordon field can be expressed a 

Gf(x " y) = ~ l J v$k- q [e {x ° ~ y°) e ~ >qix ~ Y) + e (y°- *V fl " 



which is nothing but the Feynman propagator Eq. (6.112) except for the polariza- 
tion sum factor — g uv and i. The Feynman propagator for a spinless scalar field is 

<0|n,t M , (y)] | » = *A,(* - y) = JL / ^ q CX+ie (6.124) 
(9/i9 " + m 2 -i e )A F (x-y) = -^(x-y) 

Electron Propagator 

The propagator for the Dirac particle can be defined similarly: 

iS t (x-y)m{0\T[y>(x)W(y)]\0) 

= j(0|V(xWy)|0) t x >t y (6.125) 

The minus sign in the second line comes from the anticommutativity of the 
fermion field. Taking the sum over intermediate states, we obtain a similar expres- 
sion as for the photon propagator except for the spin sum, which has to be done 
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separately for particle and antiparticle. Using the spin sum expression Eq. (4.69) 
in Sect. 4.2, 



HArie-W"-?) 



-0(Y-x)J2MPWr(p)e lpix - y) \ 

(6.126) 

The spin sum (j/ ± m) was replaced with derivatives in the penultimate line. This 
is permissible because the two time derivatives of the step functions compensate 
each other and vanish. The electron propagator Eq. (6.126) has exactly the same 
form as the photon propagator Eq. (6.115) except for the numerator, where the 
photon polarization sum is replaced by the electron spin sum when t x > t y and 
minus the positron spin sum when t x < t y : 

~ S *W= /- + J + ie -J-hve (6 - 127) 

The last equality is often used when one wants a simple expression, but its real 
meaning is defined by the preceding equation. Since it is easily shown that the 
electron propagator satisfies the equation 

(y"id fl -m)S,(x,y) = 5\x-y) (6.128) 

it, too, is the Green's function for the Dirac equation. 

Relation with the Nonrelativistic Propagator 

Apart from the spin sum in the numerator, the Feynman propagator consists of 
two energy denominators: 

Df(«) ~ -r— = T~ [-S " + 5 -1 (6.129) 

q 2 +i£ 2ojlq°-w + i£ -q -u) + iej 

It is elucidating to consider how they are related to the nonrelativistic propagator 




(b) t x < t y , and Compton scatter- 
ns both of them together. 

Let us consider the example of t-fi scattering via hypothetical spinless photon ex- 
change. In the nonrelativistic treatment, the time sequence of the process has to be 
considered. There are two intermediate states, depending on which particle emits 
a photon first (see Fig. 6.5a,b). As the energy of the two intermediate states after 
emission of the photon is given by 

£„(a) = £,+ £ 4 +<w, E n {b) = E 2 + Ei + oj (6.131a) 

the virtual photon energy that the electron at point x receives from the muon at 
point y is given by 

„ t £3 - £1 = E 2 - £ 4 > case (a) 
o° = ^ K ' (6.131b) 

( £ 3 - £1 = £ 2 - £ 4 < case (b) 

The two denominators of the Feynman propagator can be rewritten as 

q°-m = (£ 2 -£ 4 )-{£„(a)-£ 1 -£ 4 } = £ x + £ 2 -£„(a) = £,-£„(a) (6.131c) 

-( ? + w) = -(£ 3 -£ 1 )-{£„(b)-£ 2 -£3}= £! + £ 2 -£„(fe)= E { -E n (b) 

(6.131d) 

which exactly reproduces the two energy denominators of the nonrelativistic treat- 
ment. The factor l/2cu arises from the relativistic normalization. Namely, the Feyn- 
man propagator contains both intermediate states by allowing the particle to take 
negative energy and interpreting it as propagating backward in time. Actually, it 
means the reaction of the electron back to the muon, and really there is no particle 
traveling backward in time. 

Negative energy is meant to represent an antiparticle, too. In the case of the 
photon, there is no distinction between particle and antiparticle and the spin sum 
gives (—g/iv) in both cases. For the case of Compton scattering (see Fig. 6.5c,d), 
the Feynman propagator in (c) represents an electron created at y proceeding to x 
with positive energy, while that in (d) represents a particle proceeding with negative 
energy going backward in time. The latter is an antiparticle, i.e. positron, emitted 
at x that proceeds normally to y. The expression for the energy denominator is the 
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same as that of the photon propagator in e-fi scattering, but now the spin sum has 
to be done separately for the particle and antiparticle: 

- i r£ r=u »(p)u(p) -Z r=l ,2v(-pW(-p)l 
^ / + 2 ". 

where the minus sign in front of the v(—p)v(—p) comes from the Feynman pre- 
scription for the change of time order and the negative momentum comes from 
the change of the integration variable p — > —p. Recall that u(p)u(p) and v(p)v(p) 
appeared as the coefficient of e~ 'P'( x ~ y) and e lp ' l - x ~y' 1 originally. 

Note: The intermediate photon in the above example has the energy-momentum 
¥' = (<7°> <?)• Since q = Pi — pi = pi — P4, q 2 / 0, the photon is not on shell. On 
the other hand, intermediate states inserted to calculate the Feynman propagator 
are particles created by the creation operator of the free field and therefore are on 
shell in the sense that their energy co is defined by the relation co = \q\. Where did 
the difference come from? The fact is that q° that appears in the Feynman propa- 
gator is an integration variable obtained by setting q° = ±(z + co) in Eq. (6.114a). 
It is this q° that satisfies energy conservation and has nothing to do with co, the 
energy of the photon. Some people say that in the nonrelativistic treatment parti- 
cles in the intermediate states are on shell but the energy is not conserved in the 
transition, and in the relativistic treatment the energy and momenta are conserved 
but the particles are not on shell. The reader should realize that the latter half of 
this statement is not correct, as is clear from the above argument. 



6.7.1 

Cross Section 

A process in which an electron is scattered by an unmovable scattering center, typ- 
ically a heavy nucleus, is called Mott scattering. The target should be heavy enough 
for its recoil to be neglected. However, the electron should be energetic enough to 
be treated relativistically. The interaction Hamiltonian in this case is 

Jh = qj"A tl , j"=^y'> (6.133) 

As long as the recoil of the target can be neglected, there is no effect that the 
scattering electron can give to the target. Then the field created by the target 
is not disturbed and can be treated as an external source, namely as a classical 
c-number potential. Such external fields are, for example, a static Coulomb force 
[N 1 = (Zq'jAjtr, 0,0,0)] or an oscillating magnetic field [M' = (0,-yB sin cot, 
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x B sin co t, 0)]. The scattering matrix element is given by 

Sfi - d fi = -i f d 4 x(/|^|i> = -ij d'x( Pf s\qj"\ Pl r)A pl (6.134a) 

A fl (x) = (Zq'/4jtr, 0, 0, 0) (6.134b) 

where Pl r and pjs denote the momentum and the spin state of the 

and outgoing electrons, respectively. The matrix element of the Hamiltonian 

calculated in Sect. 6.6.1: 

(p f s\qj"(x)\ Pi r) = q( PfS \y(x)y"ip(x)\ Pi r) 

= qu 5 ( Pf )y"u r ( Pl )e-^-Pf^ 

Inserting Eq. (6.135) and Eq. (6.134b) into Eq. (6.134a), we obtain 



-27tid(Ei- E f )qu s ( Pf )Y°Ur(Pi)\ 
-2md(Ei - E f )qu s (p f )y u r ( Pi ) [V(q)] 



V(q)= I d 3 x e '* x V(x)=^, 



= f d 3 xe^ x V 



(6.137) 

This expression, like that for Rutherford scattering, does not contain the 6 -function 
^(Pi ~ Pf)> i- e - tne momentum is not conserved. This originates from the external 
field approximation that the target nucleus does not recoil. 

In order to obtain the cross section, we need to calculate \u 5 ( P f)y°u r (p u )\ 2 . Giv- 
en a special example, where only the 0th component of the spinor contributes, it 
may be easier and faster to use the explicit expression for u,( P ) given in Sect. 4.2, 
Eq. (4.63). But for later applications, we adopt a more general approach here. Un- 
der usual experimental conditions, the incoming particles are not polarized, and 
the final polarization is not measured either. In that case, we take the average (take 
the sum and divide by the degrees of freedom) of the initial-state polar 
the sum of the final-state polarization. Write the quantity U lv as 

L " V = \ E [uAPf)Y""APi)][u s (Pf)Y v Ur(Pi)]* 
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[u s (Pf)y V U r (Pi)]* = Ur(Pi)Tu s ( Pf ) (6.138b) 

r = y r i >° (6.138c) 

When r = y' ( , r = y'*, 

L "" = ^ E ['S.(P/)y' , »r(pi)] [^(P.)y^ s ( P/ )] (6.139) 

Now we use the properties of the Dirac spinors 

^u r (p)u r (p)=]/+m, ^v r (p)y r (p)=y-m (6.140) 

and insert them in Eq. (6.139) to obtain 

L f,v = -Tt[y"(]/i + »)y v (*> + m)] (6.141) 

The above trace of the y matrix can be calculated by making use of the properties 
of y matrices (see Appendix C), 

Tr[y'' 1 y / ' 2 ---y / ' 2 "+ 1 ] = (6.142a) 

Tr[yWy«...y^] = (-l)"Tr[y"» • • • y^Y*' 1 ] (6.142b) 

Tr[y"y"] = 4g'"' ( 6 - 142 c) 

Tr[y"y F y p y ] = 4(g*V + g"°g vp ~ g" P g v ") (6.UU) 

Then the final expression for W" is 

V' v = 2 [{p* f Pi + p}p- ~ (Pf ■ P>)g'""} + mY'"] 

= 2 \p" f pr + pjpf + i-g""J (6.143a) 

q 2 = (p l -pf? = 2m 1 -l(p r p l ) 

~-2p fPi (l-cosd) ( Pl , Pf »m) (6.143b) 



I Using the properties of y matrices Eq. (4.36), derive Eq. (6.142). 
Hint: To derive the first equation, use (y 5 ) 2 = 1, y s y /l = —y>'y 5 . 

Now, going back to the Mott scattering cross section, only /.i = v = com- 
ponents contribute. Since the energy is conserved by virtue of the d -function, 
E { = E f = E, |pj = \p f \ = p, v = p/E. Then 
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The incoming flux and the final state density are given by 



2£ ; v = 2pi = 2p (6.145a) 

(6.145b) 
Inserting Eqs. (6.144) and (6.145) into Eq. (6.137), we obtain 



d 3 p _ (dE\ l p 2 dQ _ pdQ 
\~dp~) 16jt 3 £ ~~ 16jt 3 



which is called the Mott scattering formula. Comparing this with the Rutherford 
scattering formula (6.46), we see that the relativistic effect makes m ->■ E or (v = 
p/m ->■ p/E), and the spin effect of the electron gives 



Problem 6.7 

Using the explicit solution Eq. (4.63) 






show that spin nonflip and spin flip cross sections are proportional to 



da(spin nonflip) a |^1 + - | ^ 2 cos j (6.149a) 

da(spin flip) a [ * 2 sin fl] (6.149b) 

and that their sum is equal to Eq. (6.147) apart from the normalization factor. 



Problem 6.8 

Using the current of the charged scalar particle 

qj' 1 = -ie(<p t 9'V - 3>V) (6-150) 

derive the scattering formula for a given external Coulomb potential and show that 
it agrees with the first term of Eq. (6.146). 
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6.7.2 

Coulomb Scattering and Magnetic Scattering 

Scattering by the electromagnetic field can be divided into that by the electric and 
that by the magnetic field. The Hamiltonian is written as 

Hi = q4>-/i-B, fi = -^-gs (6.151) 

where s is the electron spin and g = 2 is the g-factor. Therefore, we see scattering 
by the electric field is spin nonflip and that by the magnetic field is a spin-flip pro- 
cesses. Mott scattering by a static Coulomb potential contains the spin-flip term. 
This is because, viewed from the electron's rest frame, the target is moving and 
hence a magnetic field is induced. As is clear from Eq. (6.146), the magnetic contri- 
bution is proportional to v 1 , and it is small for nonrelativistic particles but becomes 
important and comparable to the electric contribution for relativistic particles. 

6.7.3 

Helicity Conservation 

In the high-energy limit, where the electron mass can be neglected, v = 1 and the 
Mott scattering cross section becomes 

do I 4Z 2 a 2 , , 

= — E 2 cos 2 - (6.152) 

dQ | Mott q 4 2 { ' 

which differs from that of spinless particles by cos 2 (8/2). This is a helicity rotation 
effect. The reason is 

1. the massless particle is a Weyl particle, and only polarizations along the direc- 
tion of its momentum are allowed (i.e. helicity eigenstates) and 

2. a vector-type interaction, like that of the electromagnetic force, conserves the 
chirality (see arguments in Sect. 4.3.5), i.e. helicity is conserved in the high- 
energy limit. 

The initial helicity is along the z-axis (parallel to the incoming momentum), and 
the final helicity along the scattered particle at angle 6 . The matrix element of the 
rotation operator is given by dj- ; = (j, j' z \ exp(—i6 J y )\j, j z ) and for the Mott case 
j = 1/2, j z = ±1/2, d 1 | 2 ,i/2 = cos (0/2). (See Appendix E for the rotation matrix.) 

6.7.4 

A Method to Rotate Spin 

The above argument inspires a method to rotate spin using low-energy Coulomb 
scattering. Place heavy nuclei around a circle and have a low-energy charged parti- 
cle with longitudinal polarization start from one end of a diameter. Let it be scat- 
tered by a nucleus and detected at the other end of the diameter (see Fig. 6.6). As 




Figure 6.6 Converting longitudi 






;e polarization using Mott scattering. 



the scattering angle is 90° and the spin does not flip, the particle's polarization is 
rotated by 90° , and now the particle has transverse polarization, which is suitable 
for measurement. The method was used to decide the helicity of the decay electron 
from fi decay. 



Yukawa Interaction 



Yukawa interaction was first proposed to explain the nuclear force and it was 
claimed it was due to the exchange of a spin massive meson. The interaction of a 
spin 1/2 particle with a scalar field is important both historically and for future ap- 
plications. Historically, it offered the first viable explanation of low-energy nuclear 
interactions, i.e. strong interactions. For future applications, it is important for in- 
teraction with the Higgs field, which generates the mass of all otherwise massless 
quarks and leptons. Here, we investigate a low-energy nuclear scattering mediated 
by a scalar or a pseudoscalar meson. The relevant Lorentz-invariant interaction is 
of the type 

£iNT = -gV^>0 (6.153) 

where 7" = 1 if is a scalar and r — iy 5 if is a pseudoscalar. We consider a 
process 

Ni(|>i) + N 2 {p 2 ) -+ NiQ> 3 ) + N 2 (J> 4 ) (6-154) 

and assume Ni and N 2 are distinguishable. The mass of the nucleon and the me- 
son are taken to be m and fi, respectively. The second-order scattering amplitude 
corresponding to the exchange of the <p meson is (Fig. 6.7) 
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(a) (b) 

Figure 6.7 Nucleon-nucleon interaction via a scalar meson exchange. 

S fi -d fl = -iT fl 

= (PiP^^g f d 4 xlp(x)rip(x)0(x) 

x gj d 4 y^(y)rv(y)0(y)J|pip 2 > (6 155) 

= ^f-[<P3l / d 4 xlp(x)rip(x)\ Pl )(0\T[cP(x)<p(y)]\0} 

x (p4J d 4 yny)rxp(y)\p 2 } + (x*>y)\ 

Interactions at x and y can be separated because particles 1 and 2 are two distin- 
guishable particles and the interactions take place at separate space-time points. 
The second term, where x and y are exchanged, gives an identical contribution to 
the first, it merely compensates the factor 2!. Inserting 

(p s \f d 4 xlp( X )rip(x)\ Pl ) = J d 4 xe-^-^ x u( Pi )ru( Pl ) (6.156a) 

(0| r[0(*)0(y)]|O) = J d qe 2 ^' " -^—^ (6.156b) 

( P4 \ J d 4 ylp(y)rip(y)\ P2 ) = J d 4 y e -'^-"^u( P4 )ru( P2 ) (6.156c) 

into Eq. (6.155), we obtain 

T fi = (2jz) a 6 4 ( Pi - P3 + q)(2jt) 4 6 4 ( P2 - P4 -q)J -^ 

XWrt ?^^ (6,57) 

= (2jz) 4 6 4 { Pi + Pl - Pi - P4 ) 

x u( Pi) r U ( Pl )--^—u( P4 )ru( P2 )\ 
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Scalar Meson Exchange 

r = 1 

To evaluate the amplitude in the nonrelativistic limit, we retain terms only to the 
lowest order in the 3-momenta: 

Pl = (m, Pl ), p 2 = (m,p 2 ), p 3 = {m,p 3 ), p 4 = (m,p 4 ) (6.158) 



(6.159a) 

q 2 - fi 2 = -(q 2 + f x 2 ) = - [( Pi - Pl ) 2 + fi 2 ] (6.159b) 

The amplitude becomes 

T fi = -(2jT) 4 d( Pl + P2-P1- P*) S2 ^ 2 r +™2 * U ( 6 ' 16 °) 

The amplitude of a scattering by a potential Vin quantum mechanics is given by 

T fi =2jrd(Ei- E f )V(q) 

r , i e , (6.161) 

V( q ) = jd i xr(x)V( X )ip(x)=— i jd i x e >* x V(x) 

Therefore V(q) for the scalar meson exchange is given by 

V(*) = -^ (6.162, 

We have dropped (2m) 2 and the ^-function for the momentum conservation. The 
factor (2 m) 2 comes from the relativistic wave function normalization and should 
be dropped for comparison with the nonrelativistic formula. The d -function origi- 
nates from integration over the total coordinate R, where 



= f d 3 R e ; (fi+P2-P3-P4)-« f d i re"'- r 
= (2jT) 3 d i (p l + p 2 -p 3 -p 4 )fd i re^ r 



and means the CM frame is unaffected by the internal motion. In the nonrelativis- 
tic treatment, only the relative motion is treated, so it does not enter the equation. 
The potential in the space coordinate is given by 
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which gives an attractive force. Yukawa postulated the existence of a massive force 
carrier of the nuclear force and knowing the force range to be approximately 
10 -15 m, he predicted its mass to be ~ 200 m e . 

For the pseudoscalar meson exchange, the nuclear force becomes repulsive. We 
know that the n meson is actually a pseudoscalar meson. How then can the nuclear 
force be attractive so that it can form a bound state, the deuteron? This is related 
to the charge exchange force, which will be discussed in connection with isospin 
exchange in Sect. 13.6. 



7 

Qed: Quantum Electrodynamics 



Quantum electrodynamics (QED) is a mathematical framework for treating the 
quantum dynamics of electrons with the quantized electromagnetic field. Howev- 
er, it can be applied to the electromagnetic interactions of any charged Dirac par- 
ticle, including quarks and leptons. It is a gauge theory based on the symmetry 
U(l) and shares many basic features with more complicated non-Abelian gauge 
theories that govern the weak and the strong interactions. Thus learning QED is 
an indispensable step in understanding the dynamical aspect of particle physics. 



e-fi Scattering 

As the next logical step after derivation of the Rutherford and Mott scattering, we 
treat the scattering process of a spin 1/2 particle by a target also having spin 1/2. 
When the recoil of the target is not negligible, as is the case for e-/i scattering, 
we have to use the quantized electromagnetic field. Experimentally, a viable choice 
would be e-p (electron-proton) scattering, but the proton is a hadron, which has a 
finite size as well as a large anomalous magnetic moment, and therefore is not a 
Dirac particle. So we consider e-fi scattering, which is impractical from the experi- 
mental point of view, but has applications to e-q (electron-quark) scattering and is 
useful for analyzing deep inelastic scattering data to probe the inner structure of 
the hadron. Besides, using crossing symmetry, the process can also be applied to 
e~ e + -¥ fi~ n + and e~ e + — >• qq, (i.e. e~ e + -> hadrons), which are widely used 
collider experiments. 

7.1.1 

Cross Section 

We define the kinematic variables of e-fi scattering as follows: 

e( Pl ) + fi(p 2 ) -> e( Pi ) + fi( P4 ) (7.1) 

In p; (i = 1 to 4), we include spin orientation as well, unless otherwise specified. 
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(a) (b) 

Figure 7.1 e-fi scattering process. 

In the lowest order, e-fi scattering occurs as shown in Fig. 7.1, by exchange of a 
photon. As there are two vertices in the diagram, it is a second order process. The 
Hamiltonian contains electron as well as muon interactions: 



■ft = «(j* + Jl) A * = -«(v.y v v. + tfvy'Vi.) a, (7.2) 

The scattering matrix is given by 

S/< " */< = ^f / rf4 * / rf4 v T ^3, ?4l *i(*)*i(y)lpi. P2>] (7.3) 

Considering the e-transition pi — ► p 3 and ^-transition p 2 — >■ £>4 to occur in dif- 
ferent space-time positions, they can be separated. We insert intermediate states n 
between the two Hamiltonians, 



= £|n><»| 



where n represents any physical state that can be realized through the interaction. 
The intermediate state in Fig. 7.1 corresponds to one photon with energy-n 
turn q 1 * and polar 



^'^'^/f^ 1 ^' 11 ' W = lql 



(my 

Inserting the intermediate states between the two Hamiltonians, T(- • • ) in Eq. (7.3) 
is expressed as 

^/(2^j&; T[(i53l ^ l(x)lj,1 '' ll>(],4 ' ?1| ^ l(y,li52>] (7 - 6) 

There is also a term with p x o p 2 , Pi ** Pa, but exchanging the integration 
variable x +> y makes it identical to Eq. (7.6) and merely compensates the factor 2!. 
Therefore it is enough to consider Eq. (7.6) only. Referring to Eqs. (6.104a) and 
(6.104b), Eq. (7.6) can be expressed as 

{p 3 \Xi{x)\p u qX) = -eu s (p l )y"u r (p l )e-^-^(Q\A fl {x)\ql) 
(p 4 , qX\Xiti)\pi) = - e u m (p 4 ) Y v u l (p 1 ) e -'^-^{ql\A v (y)\Q) 
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(7.8) 



S fi - 6 fi = (-i) 2 jd 4 x J d 4 y {- e u s ( Pl )y"u r ( Pl ) e -'^- 
^J (2^) 3 2 

x{- e U m (p4)y"M|(P2) e -' (P2 - p4) - y } 

As the middle term is the Feynman propagator, we can insert Eq. (6.115) into it. 



-i(pi- P 3 



(7.9) 



Sfi - d fi = (-i) 2 Jd 4 x J d 4 y {-eu s ( Pi )y"u r ( Pl ) t 

x _J_ /> qe -W*-y)^L. 
(2?r) 4 7 * ? 2 + le 

x{- e u m (p 4 ) y "u l (p 2 ) e - i ^-^^} 

As integration over x, y gives 

f d 4 x^(2jt) 4 d 4 ( Pl - Pi + q) 

J (7.10a) 

y d 4 y^(2jl) 4 d 4 ( P2 - P4 -q) 

H^4 / ^ [j/" d+xd4 v] ~* (^V + P2 - P3 - p 4 ) (7.10b) 

the matrix element of the scattering matrix becomes 

Sji = d fi - (2jt) 4 id 4 ( Pl + P2 - Pi - Pi)M fi 

= d fi + (2jz) 4 d 4 ( Pl + P2 - Pi - P4 ) 

I -'&- \ „ (7-11) 

x u s ( Pi )(iey")u r ( Pl ) [^fj- E ) u m {PA)(^Y v )ui( P i) 

q = P3~Pl = P2-P4 

which gives the Lorentz invariant matrix element M /j 

_ ! (7-12) 

= -e 2 u s (ff 3 )y / 'u r (ffi) 2+ . £ u m (p 4 )y fl ui(p 2 ) 

Under normal experimental situations, the polarization states of the particle are not 
measured. Then to obtain the transition amplitude, we have to take the spin sum 
(average of the initial- and sum of the final-state polarization). We have already 
done part of the calculation [see Eqs. (6.139)-(6.143)], 

L "" = \ E [us(Pi)Y"u r (pi)][u r ( P i)y v u s ( Pi )] 

(7.13a) 
= 2\ Pi u P i" + Pi v P i u + ^-g^\ 
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The same calculation for the muon part gives 
M /IV = 2 p 4/l p 2v + p^Pif, + \g f ,v 



The current conservation law d /i j'* = gives 

q^V* = q v V v = 0, q M M /,v = q„M f,v = 
which can be explicitly verified from expressions (7.13). 



Problem 7.1 

Prove Eq. (7.14) by actually multiplying Eq. (7.13) by q^. 
Using the relation 

p 4 = PA-V2 + Pi = -q + Pi 
and Eq. (7.14), one sees that p 4 in M„„ can be replaced by p 2 : 
N = L f,v M flv 

= 4 \p 3 " P l V + Pi V pi fl + yg""J ]2P2,P2 V + Jgj 

= 4 4(pi ■ p 2 )(P2 ■ Pi) + q 2 p\ + q 2 (Pi ■ Pi) + l^r) x 



2(Pi -P2) = (Pi + P2) -p 1 -p 2 = s-m -M 
2(P2 ■ Pi) = -(Pi ~ P2) 2 + p\ + p\ 

= -(pi - p A ) 2 + m 2 + M 2 = m 2 + M 2 -u 
2(J>1 • ?3) = -(Pi - Pif + P\ + Pi =2m 2 -t 
N is rewritten as 

N = 4 (s - m 2 - M 2 )(m 2 + M 2 - u) + t(m 2 + M 2 ) + M 
= 2 [(s - m 2 - M 2 ) 2 + (u - m 2 - M 2 ) 2 + 2t(m 2 + M 2 )] 
where we used 

s + t + u = 2(m 2 + M 2 ), q 2 = t (7.16c) 

in deriving the last equality. Substituting Eqs. (7.16b) in the square of the matrix 
element Eq. (7.12), the invariant scattering amplitude (squared) becomes 

\Mj~i\ 2 = - 4 N 

le* r t 2 i (7 ' 17) 

= ^\(s-m 2 - M 2 )(m 2 + M 2 -u) + t(m 2 + M 2 )+-\ 



(7.16b) 
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where |<M/;| 2 denotes that the average of the initial spin and sum of the final spin 
state were taken for \Mfi\ 2 . In the LAB frame, neglecting the electron mass (m = 
0), the invariant Mandelstam variables are expressed as 

s- M 2 = 2p 1 M, M 2 -u = 2piM, t = -Ipxp^l - cos 0) (7.18) 

Inserting the above expression into Eq. (7.16b), the invariant matrix element is 



\Mf l \{ Ai =-\(>M 2 p l p i a 



2M 2 



(7.20a) 



As the cross section in LAB is given by Eq. (6.91), 

4a 2 E 2 , 

da Uoa = —^ cos 2 - (7.20c) 

q 4 2 

The first term in square brackets [• • ■ ] represents scattering by the electric field 
created by the target and the second by the magnetic field. As p 3 = pi/[l + pi(l — 
cos0)/M], the limit M ->■ oo reproduces the Mott formula. 
The cross section in CM is given by Eq. (6.90) 

S| CM = 77^ /i|2cM (7 ' 21) 

In many cases of high-energy scattering, where s,u » M 2 , we can set M 2 = 0. 

Then 

da I _ p/ 1 (4jra) 2 2 2 

dfl | CM ~ ~p~Mji 2 s q 4 (S U) 



/i + coseV 
. a 2 s 2 + w 2 _ a 2l + (— ) 



This is a formula that appears frequently later in discussions of deep inelastic scat- 
tering. 

7.1.2 

Elastic Scattering of Polarized e-/i 

We consider in this section longitudinally polarized electron-muon scattering. We 
use here chiral eigenstates to describe the processes. The advantage is that the 
chirality does not change during the interaction and the calculation is easier than 
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that using the helicity eigenstate. Besides, it is also the eigenstate of the weak in- 
teraction and the derived formula can be applied directly to neutrino scattering. 
The disadvantage is that the chiral states are not eigenstates of the Hamiltonian, 
and the scattering amplitude written in terms of the chiral eigenstates does not 
describe the physical process accurately and hence does not necessarily reflect the 
properties of helicity eigenstates correctly. However, as was explained in Sect. 4.1.2, 
the massless particle is a Weyl particle and chirality L(R) state and the helicity states 
are the same except for a sign change between particles and antiparticles. In most 
high-energy experiments, the energy is high enough to justify setting the mass of 
the particle to zero. This is certainly true for electrons (m ~ 0.5 MeV), muons and 
light quarks (u, d, s) (m ~ 5 to 100 MeV) for typical experiments where the energy 
is above ~ 1 GeV. 

The conservation of chirality in vector-type interactions, which include all the 
strong, weak and electromagnetic interactions in the Standard Model, was already 
shown in Sect. 4.3.5. This means 

yyHy = yj t y"ip L + lp R y f 'ip R 

^l/^r = v^y'Vi = o 

If we make use of the property that the chiral eigenstate does not change after 
interactions, the unpolarized e-fi scattering can be decomposed into the following 
four processes: 

e L fi L ->■ e L fi L (LL) (7.24a) 

e Rf i L -> e Rf i L (RL) (7.24b) 

e^R -+ e L fi R (LR) (7.24c) 

e R fi R ^e R fi R (RR) (7.24d) 

Let us consider e L/ w L — >• e L // L first. The scattering amplitude can be obtained from 
that of unpolarized c-/a scattering by the substitution y fl — >• y f '(l — y 5 )/2, hence 



} 64 ttV 



(7.25a) 



\Mf,\ 2 = - 4 K L ^( Pl , Pi )K L/lv (p 2 ,p 4 ) (7.25b) 

K L <"'( Pl , Pi ) = ^u s (p 3 )y"(i-y 5 )u r (Pi)u r (Pi)y , '(i-y 5 )u s (P3) (7.25c) 

= ^Tr[y"(l - y 5 )^ + mi )y v (l - y 5 )(j/ 3 + m 3 )] (7.25d) 
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To calculate the trace, we use the properties of the y 5 matrix: 

yfy 5 = -y 5 yf , y 5 = -_ £/Jl , pa y-" y "y Py ° (7.26a) 

e 0123 = -£0123 = 1 (7.26b) 

(—1 fi vpa = even permutation of 0123 
+1 fivpa = odd permutation of 0123 (7.26c) 

otherwise 
£""'%/* = 2(g"V y " g py g ud ) (7-26d) 

Then 

Tr[y 5 ] = Tr[y V] = Tr[y V/1 = Tr[y Vy V] = (7.27a) 

Tr[yVy Vr"] = ^ie"""" (7.27b) 

Using these relations, Eq. (7.25d) can be simplified to give 

K L >"'(pi,pi) = P' ,v + Q"" (7.28a) 

= 2[{p 1 "p 3 , ' + Pi>3"-g" , '(Pi-P3)} + ie'"' p >ipP3„] (7.28b) 
^l "(Pi> ]'3)^v(l'2. £4) can be calculated using the following equalities: 

P" v ( Pl , Pi )P flv (P2, Pa) = 8[(pi • p 2 )(Pi ■ Pa) + (Pi ■ Pa)(Pi ■ Pi)} (7.29a) 
Q!' v (Pl Pi)QMP2, Pa) = &[(pi ■ P2)(Pi ■ Pa) ~ (Pi ■ Pa)(Pi • J» 3 )] (7.29b) 
pf<"Q /lv = p„ , Q" " = (7.29c) 

Hpi ■ Pi){Pi ■ Pa) = [(Pi + Pi) 1 ~p\- PJ] [(Pi + Pa) 2 ~ p\ ~ pi] 

= {s - m \-m\){s-m\-ml) (7.29d) 

Similarly 

4( Pl ■ p,)(p 2 ■ Pi ) = (u-m\- m 2 4 ) (u-m\- m\) (7.29e) 

Kr(Pi,Pi)K Lflv (p2,PA) = 16(pi-p 2 )(pi-p4) 

= 4(s _ m 2_ m 2 )(s _ m 2_ m 2 ) . = 4s 2 

Inserting Eq. (7.30) into Eq. (7.25), we obtain 

dau -= l£>\ ( LL ) = T2- S < 7 - 31 ) 

dQ | CM t 2 
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In calculating the cross section for e R/ M L scattering, we only need to replace y 5 by 
— y 5 in the electron part of K L (|>i, p 3 ). Then, Q /iV in K L (p l7 p 3 ) becomes — Oj"",but 
that in the muon part does not change. Then Q? v Q,iv is changed to — Qf v Q^ v and 
the net effect is 

(Pi-Vi)(Pi-VA^(V\-P*)(Vi-Pi) or s^u (7.32a) 

.-. ^gRL=^- = ^sf 1 + ^° Sg ") (7.32b) 



Others can be calculated similarly: 

da LR = da RL , do RR = da LL (7 

The unpolarized scattering cross section is simply the sum of all the above c 
s divided by 4 to account for average L, R fractions in the beam: 






!+ — ^ < 7 - 34 > 



As t = q 2 , this agrees with Eq. (7.22). 

The above expression can be understood in terms of helicity conservation. In the 
CM frame, the e L // L and Br/^l two-body systems are in the helicity eigenstates of 
the total angular momentum J = 0,1. The system scattered by angle 6 can be 
obtained from the incoming system by rotation if there are no other changes. The 
rotation matrix elements (see Appendix E) are given by 

4,j, = 0'.Jzk" iJ?e |JJz) (7.35a) 

d° = l , d\ 4 = -(1 + cos 0) (7.35b) 

The helicity states are described in Fig. 7.2. For LL or RR scattering, the sum of the 
] z is zero and is conserved at any angle, but for RL or LR scattering J z = ±1 and 
it is inverted at the scattering angle — jt, violating angular momentum conser- 
vation. The rotation matrix reflects this conservation law and gives the suppression 
factor (1 + cos Of 12 in Eq. (7.35). 

Note, that e L ** e R exchange gives s o u in the scattering matrix. As s o u 
exchange can be alternatively realized by crossing (pi -> — p 3 , p 3 ->• — pi), par- 
ticle o antiparticle exchange of one or other partner in the scattering and that 
of R o I are equivalent as far as the kinematics of the scattering is concerned. 
In e-fi scattering the crossing can be distinguished by seeing the charge change 
of the particle, but for neutral particles, such as neutrinos, looking at the angular 
distribution difference is one way to tell particles from antiparticles. 



e e + — ► ft fi + Reaction 

It is possible and not so difficult to calculate the ee -> fijl production cross sec- 
tion using the standard procedure we have learned so far [see Eq. (C.17)]. Here, 
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Before scattering 

(b) 



After scattering (6=7t) 

(c) (d) 

Figure 7.2 The difference of the angular dis- 
tributions by helicity conservation. For LL but after scattering at the scattering a 



in gJz = 1/2 + 1/2 before scattering, 



ring, sum of the angula 
e-fi two-body system is zero 
served at any angle. For RL si 



(1 + c 



— 1 and the angular 
Dt conserved. The suppres: 
d) 2 /4 reflects this fact. 



instead, we use crossing to derive the cross section. The e^i scattering amplitude as 
shown in Fig. 7.3a can be converted to ee -*■ fiji reaction (Fig. 7.3b) by crossing 
(Pi^-Pi,P2^-pi)- 
We attach a subscript t to the crossed channel variables to distinguish those be- 

(7.36a) 



= (Pi + Pi) 2 -^(Pi-Pif = u t 

= (Pi -Pi) 2 = q 2 ^(pi + Pi) 2 

= (P1-P4) 2 -^(Pi-Pa) 2 = h 



(7.36b) 
(7.36c) 



The scattering angle 6 t in the crossed channel is defined by the angle between e 
and fi. The spin-averaged scattering matrix element (squared) is then converted to 



\M fi \ 2 (efi -» efi)- 



- \M fi \ 2 (et 



(7.37) 




Dropping the subscript, the cross section for ee -> [ifi becomes 
do I a 2 t 2 + u 2 4jt« 2 f 3 






(l + cos 2 0) (7.38a) 



/4-Jia 87 
do- = = r nb (nb = 10~ 33 cm 2 ) (7.38b) 

3s s[GeV 2 ] 

This is a basic cross section used in electron-positron colliding accelerator experi- 
ments. 

Problem 7.2 

Show if the mass of the produced particle is not neglected 

a TOT = l^F03) 

n-P\ (7 ' 39) 

F(P) = ' , = particle velocity in CM 



Compton Scattering 

Scattering Amplitude 

The photon-electron scattering is known as the Compton scattering. Historically, 
it is famous as the process to have shown the particle nature of the light wave 
and have proved Einstein's light quantum hypothesis proposed in 1905. Here, we 
calculate the process with later application to QCD in mind. The variables in the 
reaction are defined as 

y(k) + e(p) -> y(k') + e (p') (7.40) 

In the lowest order, two processes shown in Fig. 7.4 contribute. 
The scattering matrix is expressed as 

S fi - d fi = t^L J d A xd A y(f\ T[j"(x)A^x)j "(y)A v (y)]\i) (7.41) 

From now on we will omit d j; from the scattering matrix where there is no confu- 
sion. Separating the photon and electron parts 

Sfi = (~iq) 2 f d 4 xd 4 yT[(k'\A fl (x)\0){p'\jf(x)j»(y)\p)(0\A v (y)\k)] 
= (-iq) 2 f d 4 xd 4 y £/l (kr{p'\T[j f, (x)j v (y)]\p) £v (k) e > ( k '-*- k -v'> 
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Figure 7.4 Two diagrai 



(p'\T[J f '(x)J v (Y)}\p) = 0> , |T[v(*)y"v(*)v(y)y , 'V(y)]|j>) < 7 ' 42b > 

where the polarization index /I has been suppressed for simplicity. The factor 1/2 
is canceled by an identical contribution obtained by x <h> y in Eq. (7.41). Corre- 
sponding to two annihilation operators \p , there are two ways of making the matrix 
element. When x° — y° > 0, writing the matrix elements suffixes explicitly 



(1) 



(2) 



(p'\lPa(x)Ya^l](x)lpy(Y)Y; d yd(Y)]\p) 



(2) 



(1) 






In the bracket labeled (1), the incoming particle p) is annihilated by %p$ at point y 
and the outgoing particle (p'\ is created by ip u at point x. In brtacket (2), \p) is 
annihilated by ipp at point x and the outgoing particle (p'\ is created by \p ' at 
point y. Therefore, bracket (1) corresponds to diagram (a) of Fig. 7.4 and bracket (2) 
to (b). The matrix element of bracket (1) can be written as 

(p'\T[j"(x)j v (y)]\p)\ {a)x o >Y<> 

, u — ., , . (7-44) 

= »a(p')y f ^(0\ipi3(x)ip Y (Y)\0)y; d u d (p)e'(P -x-?^ 

When x° - y° < 

(p'\T[j fl (x)j*(y)]\p) = ( P '\j v (Y)j"(x)\p) (7.45) 

This is the same as Eq. (7.43) if the arguments are interchanged: 

x ^ y , fx ^ Vj ap^yd (7.46) 

(1) (2) 

(p'\¥y(Y)y V y6TPd(y)W(x)aY^p(x)\ P ) 



(2) 



(1) 



(7.47) 



This time diagram (a) corresponds to bracket (2), and its matrix element is given 
by 

(p'\T[j"(x)j v (Y)]\p)\ {a)x o <y o 

, u — ; ' s ( 7 - 48 ) 

= u a (p')Y^{0\ ~ V r Wfi(*)\0)Y;tUa{p)e ,< ' ' X ~ P ' Y) 
Combining Eqs. (7.44) and (7.48) 

{p'\T[j f '(x)j^Y)]\p)\ {a) = ^p')yt'{0\mix)ljJ(y)]\0)y''u( P )e^'- x -i'-y'> 

(7.49) 

(|---|) is just the Feynman propagator. Using Eq. (6.126), the scattering matrix 
element corresponding to diagram (a) of Fig. 7.4 becomes 

S/,1,., = f d'xd'y e^+^-^+^e,(kru(p'){-^Y") 



= (2jr) 4 5 4 (fe + p - k' - p') 


-iqy v 


(7.50b) 

\u(p) £v (k) 

(7.50c) 


XC ' (l) U(P)( lq} \ v + kY-m* + ie ( 



g matrix for diagram (b) can be obtained similarly, but is more easily 
obtained by crossing diagram (a), i.e. by interchanging the variables k o — V , 
£(k')* <* e(k). Then the total scattering amplitude is given by 



{p + k) 2_ m 2^ ' (p _ k ,)2_ n 



(7.51a) 
/*]«(?) (7.51b) 



where e/= y fl e fl (k), e'/ = y>' £,, (fe') are used for brevity. The calculations that follow 
are familiar trace algebra. But what will become important for later applications is 
the high-energy behavior at s » m 2 . So we will set m = 0, which considerably 
simplifies the calculation. 

Summation of the Photon Polarization Assuming that neither photon nor electron 
are polarized, we take the average of the initial and sum of the final states. As 
discussed in Sect. 5.3.4, the polarization sum can be extended to all four states, 
giving 

J2 B4Xfe v (X) = d tj - -^ -> £ M*)*MA) = -ft,v (7-52) 
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This is because the polarization vector always appears coupled with the conserved 
current. Since the conserved current satisfies 

d^j" = 0^ fe,,j'' = fe°j°-|fe|j 3 = (7.53) 

in the coordinate system where the z-axis is taken along the momentum direction. 
Then the polarization sum of the matrix elements (squared) is rewritten as 

Y^m 1 ~ £>„(M)j"---i 2 ~ -g, r j"r x (•••) 

; ^r^( k °) 2 \ ^ti <7 - 54) 



k 2 ■ ( k l v\ 



}'i }J( 



The first term vanishes because for the real photon k 2 = 0. 

As the degrees of freedom of both the photon polarization and electron spin are 
two, we divide by 4 in taking the spin-polarization average: 

l^l 2 = 4 E !•**« + Mb f = \ Ed^l 2 + l^l 2 + 2 ^eM a M* h ) 

= 2e 2 (-" - £) (7.55a) 

We prove the first term. Setting m = and picking up only the numerators, we 
have 

J2\M a \ 2 - e'J2^(p') /'*(/+ It) ?u r (p)u r (p) f*tf+ V) fu,[p') 

(7.56a) 
Using (see Appendix C for trace calculation formula) 

J2 t* ■■■ t= X) e ^y"' , ' e »y 1 ' = (S/"')Y f '---y v = -Y/f'Y 1 ' (7.56b) 
y fi qyK=-2q (7.56c) 



: 2 j/ (7.56d) 



E ^* (E "*r(j»)«r(j>)J t = -//.(/+ m)y" 

E ** (E »-(P')» s (P')J / - 2 / (7.56e) 

.-. I £|^ a | 2 = e 4 Tr[ (l /+£) ]/(/+£)/] 

= e 4 Tr[/yy/ +J /y^/+/fyy/+/fy^/] 

= e 4 Tr[# i ]t /] = 4e 4 [2(fe • p)(fc • p ') - fe 2 (p • p')] 

= -2e 4 su (7.56f) 
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Here we have used the relations 

p?pf=p 2 = m 2 = t )i)i = k 2 = 0, s = 2(k-p), u = -2(k-p') 

(7-56g) 
Recovering other factors, and using that the denominators give (p + k) 2 — m 2 ~ s 2 

\M a \ 2 = -2e 4 - (7.56h) 

The interference term vanishes. Then the cross section is obtained from Eq. (6.92) 

do \M fi \ 2 ^W/_»_s^ 

dt 16jt(s-m 2 ) 2 " s 2 V s J { } 



When Eq. (7.51) is written as e'* JC" ' e v , gauge symmetry requires (Sect. 5.3.4) 
e under the transformations 



Therefore the following equalities should hold: 

k' /jl M /iv = k v Mf" = (7.59) 

Prove the equality explicitly using Eq. (7.51) 

Problem 7.4 

Show that gauge invariance does not hold with either M a or Mb alone, but holds 

with both terms added together. 



Show that if we assume that the incoming photon is virtual and has mass fe 2 
— Q 2 , then the interference term does not vanish, and Eq. (7.55a) becomes 



This equation appears frequently in QCD. 



If we assume the electron mass is nonzero, the calculation can be carried o 
the LAB frame. Writing the variables as 

fe = (ftU), fc' = K,fc'). P = (m,0), e = {0,e) 
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then the following equalities are satisfied: 

£ .p = e '.p = e .k = £ / .fc' = (7.61a) 

£ . p' = - £ .k' , e'-p'=e'-k (7.61b) 

Taking the average of the electron spin but leaving the photon polarization as it is, 
derive after a straightforward but long calculation 

This is known as the Klein-Nishina formula. 



Scattering Amplitude'" 

Bremsstrahlung (braking radiation in German) is a process in which a charged 
particle emits a photon when its path is bent by the electric field of the nucleus. It 
can be considered as Compton scattering by a virtual photon (y *) produced by the 
nucleus: 

y*(q) + e( Pl )^y(k) + e (p f ) (7.63) 

The scattering amplitude is obtained from that of the Compton scattering Eq. (7.51) 
by replacing the incoming photon (q) with an external field A'' = (Ze/ r, 0, 0, 0). 

. s _;„., Ze Ze f 4 lq . y d u0 6(q°) 

9 = Pf-Pi + k 
S fi = +27td(Ei- E f - w)j^u(Pf) 

X [^^V7TJ^ {iey0) + ^p^J^n^] "W 
= 2md(Ei - E f - w)u(p f )eru( Pl )A° (7.65) 

* ( p f+k) 2_ m 2r +r {pi _ k) 2_ m 2? ' 

o _ Ze2 

A ~~w 
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The unpolarized cross section becomes 

da( Pi -» p f + k) = 2jtd(E t -E f - ^^-yju^utf 

^_v 1J _AL (7M) 

X |g| 4 2£ / (2^) 3 2«(2jt) 3 

where J2 ta kes the average of the initial and the sum of the final electron spin 
states and the sum of the photon polarization states. The trace we want to calculate 



Zr^--\ E ?((r^ r 



X (/; + in) 




•{?%',? <-<*il\ m ?)<*+- 


"J 


= 5 (T, + T 2 + T,) 


(7.67) 


We have rewritten the denominator using the relation 




(pf + k) 2 _ m 2 = 2(fe . pf) f (p[ + k) 2 _ m 2 = 2(fe . p;) 


(7.68) 



Choosing e° = 0, the calculation of the traces yields 
Ti = (Pfk)- 2 J2^ 

x {/(//+ £ + m)y V> + m)y V/+ £ + m) /(// + m)} 
= 8(fe- P/ )- 2 

x £ [2(e • p f ) 2 {m 2 + 2EiE f + lE.cn - (p { ■ p f ) - (k ■ Pl )\ 

+2(e • p f )(E ■ Pi )(k ■ Pf ) + 2E t a(k ■ p f ) - (k • Pi )(k ■ p f )] (y 69a) 
T 2 = T x ( Pl o -p f ) (7.69b) 

T 3 = -(p f ■ k)~ l ( Pi ■ k)- 1 £>[y V~ ]i + m) ftf + m)y° 

x tf f + ji+m) M f + m) + fa o -p f )\ 

= I6(k • ?/) _1 (k • Pi)" 1 E M e " *')(* " ^/>{( fe " Pi) ~ < fe " ?/) 
+ 2(p l -p f )-4E l Ef-2m 2 } 
+ (e ■ p f ) 2 (k ■ Pi ) - (e ■ Pl ) 2 (k ■ p f ) + (k ■ Vi )(k ■ p f ) - m 2 oj 2 



»{o»(j> ( -j. / )-B i (p / -k)-E / (k-p i )} 
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Figure 7.5 Bremsstrahlung. The emitted photon is in the z direction and the emitted electron is 
in the y-z plane. The incoming electron is in a plane rotated by angle from the y-z plane. 

The reaction does not take place in a plane, so we choose a coordinate frame as 
described in Fig. 7.5. 

We take the z-axis along the photon momentum fe, the emitted electron mo- 
mentum p j in the y-z plane and the incoming electron momentum p ; in a plane 
rotated by an angle <p from the y-z plane. In this coordinate system, the parameters 

F = («,0,0,fc) a) = \k\ = k 
p'} = (E f ,O,p fS m6 f ,p f co S f ) 

pf = (£;, pi sin 8i sin cp, p t sin 0, cos cp, p t cos 0,) (7.70) 

e 1 = (0,1,0,0,) £ 2 = (0,0,1,0) 



where p; = |p ; |, pj = \p A. Then the 
Pf d. 



4jt 2 pi\q\ 



dQ k dQ f 

(4Ef- 



l(E f -p f cos 6 f ) 

x(4E}-\q\ 2 ) + 2co : 
p iPf sm8 iS 



(Et-ptcosd 
\ + Pj sin 2 6 f 



(Ef-pfCosdfKEi-piCc 

This is known as the Bethe-Heitler formula. 



(Ef- Pf cos dfUEt-piCosdi) 

nflrrosrft , , 1 

(4£ i £ / -|q| 2 + 2« 2 ) 



7.3.1 

Soft Bremsstrahlung 

Now we consider the process of soft Bremsstrahlung. Here we are only interested 
in the soft photon limit m = fe° ->• 0. Then the photon momentum k in the 
numerator of the scattering amplitude Eq. (7.65) can be ignored. Then using the 



(7.72) 
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relations 

(/, + in) f (k)u( Pi ) = [2 (e* • Pi ) - fW - m)] u( Pl ) 
= 2(e*- Pi )u( Pi ) 
nPf) f(Wf + m) = [2 (e* • Pf ) - tf f - m) f] 
= 2u(p f )(e*.p f ) 
the Bremsstrahlung amplitude can be reexpressed as 
S fi = 2jzid(E, -E f - co)u(pf)Y°u( P )A 

JJ (£*-Pf) (e*-Pi) Y\ A o = _^! < 7 ' 73 > 

L \(k- P f) (k- Pi ))\ \q\ 2 

If we approximate q = P ^— Pj + k ~ P f— Pi in A , this is justthe elastic (i.e. Mott) 
scattering amplitude [see Eq. (6.136)] times a factor for emission of a photon. Thus 
the Bremsstrahlung cross section for soft photon emission is the Mott scattering 
cross section times the probability P y of emitting a photon: 

da[e( Pl ) -> e(p f ) + y(k)] = do Mott ( Pl -+ Pf )dP y 



c 2 (^ • Pf) (£* ■ Pi) 
l0J W -tti \( k -Pf) ( k 'P')\ 



dPy = V 

y 2cn(2n\ i ^ 



Since the polarization sum can be extended to all A 



= aC2dmCdQ\ ( Pi 
7 it J co J 4jx [(k-pi 



Pf) 1 \ ™ 2 | m2 I 

)(fc- P/ ) 2( ( fe. i3[) 2 (fe.p^l 



(7.75) 

where k = k/\k\. The integrand goes to oo as to — ► 0. This is known as the infrared 
divergence. The resolution of this question comes when the one-loop corrections 
are added properly. We shall discuss the infrared divergence in detail in Sect. 8.1.3. 
Note that the number of photons increases as co — >■ 0, but the emitted energy 
remains finite. Here, we simply cut off at co = ft, assuming the photon has a 
small finite mass. The total photon emission probability also diverges formally if 
integrated over all the momentum range. But remember we have limited |fc| <SC 
\q\ ~ \p i —pj\in making the approximation, so the integral should have an upper 
limit around \q\. Terms in the braces {• • • } are easily calculated to give 

/dO m 2 m 2 C +l 1 
-^ = ^ dcosO -i - = 1 (7.76) 

4jt {k -p) 2 2£ 2 i_! (l-^cosfl) 2 v ' 

The first term in square brackets [• • ■ ] can be integrated using Feynman's parame- 



h-£- 
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Writing (fe- pi) = £;(1 - k •/?,) • •• , 



dQ 


(Pi-Pf) 








^ (k- P i)(k-p f ) 








-t: 


f+*ia»(h, 


1-A 


Pf 




dx L 2 (1 _ 


-t-{0 f x + 


A(l- 


x)!] 2 


-s: 


l-\p fx+ p l{ l- 



-x)\ 2 






~-i: 


d- x -^ c 


os 6 


X) 




~ 1-/32 + 4/3 2 gin 2 


(0/2)x(l- 




=/: 


m 2_g2 x(1 _ x) 








„, l 1 


f [-<j 2 /(3m 2 )] + 0((, 


n 2 /(-q 2 )) 2 ) 


m 2 /(- 


-q 2 ) » 1 


(in 


(- ? 2 /(m 2 )) + 0(m 2 /| 


-? 2 )) 


m 2 /(- 


-q 2 ) « 1 


-(VI 


-Pi) 2 - I 2 = 4p 2 si: 


a 2 (6/2) 







where the integration over 0^ was carried out choosing the z-axis along /? j-x 
/? ; (1 — x). The soft Bremsstrahlung cross section is then 



ir(? 2 ) = f 



^ 2 - q 2 /2 



- ? 2 x(l-x) (7.79) 

^J+O W| I Nonrelativistic 

In (=£) - 1 + O (^J Extremely relativistic 

The angular distribution of the emitted photon is interesting. When the electron 
energy is much larger than its mass, i.e. /? — > 1, the denominators of the expres- 
sions in the bracket of Eq. (7.75) go to zero as 6 -*■ and the radiation is strongly 
peaked forward: 

V y 2 (l + l3)J 2 2y 2 

1/2 1 1 (7.80) 

■ 1 -^«% ! + ( . ) % ! + ( » ) . 

which means about 1/2 of the total radiation is within a narrow angle of < l/y . 
The emission, therefore, is strongly concentrated in the initial and the final electron 
directions. 

Note, the soft photon emission probability Eq. (7.79) factorizes as the product 
of the elastic scattering and the photon emission probability. In the soft photon 
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limit, the momentum loss of the electron is negligible. Then the electron does not 
change its state after the soft photon emission and the emission probability of the 
second soft photon is the same as the first. This means the soft emission is sta- 
tistically independent of the elastic scattering and we expect a Poisson distribution 
for the n-photon emission probability. The phenomenon is treated in more detail 
in Sect. 8.1.4. The electron current is then a classical current source of the photon 
emission and we expect a semi-classical treatment would produce the same result. 

Problem 7.7 

Consider a classical source where the momentum of a moving charged particle 

changes instantaneously at t = 1 '. The current can be expressed as 

\( Pf "/m)T t > 



(1) Show that 

/(*)" = J d A xe lk - x j>'(x) = 



\Pf__ P^_] (7 . 82) 

We may assume the integrand of Eq. (7.81) is multiplied by e _<F ' r to assure conver- 
gence. 

(2) Show that the one-photon emission probability agrees with dP y given in 
Eq. (7.74) 

Hint: Calculate S fi - d fi = -i f d 4 x(y(k)\A /l (x)\0)j /l (x), treating the photon 
quantum mechanically. 



Feynman Rules 

Until now, we have used Feynman diagrams to aid calculation, but there is a deeper, 
in fact, one-on-one correspondence between the diagrams and the scattering ampli- 
tudes. With some experience, the calculations are made much easier. We summa- 
rize here the correspondence so that one can reconstruct the scattering amplitude 
easily by simply looking at the Feynman diagrams. As the scattering amplitude is 
expressed as S = T exp (— i f d 4 xXi), once we know the interaction Hamiltoni- 
an, the corresponding terms are uniquely defined. The electromagnetic interaction 
Hamiltonian is obtained from the Lagrangian (Mi = —£i) by the replac 

the emitted photon energy would be limited cu < 1/At, providing a natural cutof 
photon energy. 
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3,, ->• 3,, + iqAf, 

spin 1/2 particle - \3t x = -iqlp(x)y"ip(x)A fl (x) (7.83a) 

scalar particle — iM\ = qA ^[(p^ &' (p — d^cp^cp] + ie 2 A /l A''(p Jf <p (7.83b) 

(1) Feynman diagrams First draw all the topologically different diagrams for the 
desired process. 

(2) External lines (Fig. 7.6) 



/ / II I I Ti 

Annihilation of Creation of 

incoming particles outgoing particles 

External lines 

(a) (b) 

Figure 7.6 Basic elements of the Feynman diagram: external lines. 

Assign wave functions to the external lines (incoming or outgr 



scalar boson 
electron 

photon 



ip(x) -> u( V 'yi 

y>(x) -> vipyt 

A fl (x) -+ £/( (k')*< 



<p(x) 

if>(x) 

A,,(.v) 



lg particles) 
annihilation 
u(p)e- J f* 



(3) Vertices (Fig. 7.7) 



x y* 



Vertices 
Figure 7.7 Basic elements of the Feynm 

To the vertices assign 

scalar-photon 3-point vert 
scalar-photon 4-point vert 
electron-photon 3-point vi 



-> -iq(k+k'r 
-> 2iqV" 



(4) Internal lines (Fig. 7.8) To the internal lines that connect two vertices, assign 
the Feynman propagators 



--• • — * — 

rnal lines: Feynii 


-• «vww 

nan propagators 




! Basic elements of the Feynman diagram: interna 


lines. 


seal: 


(Q\T[<p(x)q>(y)]\0) 


-> iA F (fe 2 ) = 


photon 


<0|T[A„(%)A„(y)]|0) 


^iD F (fe 2 ) = 


electron 


(0|T[v(*)^(y)]|0) 


^iS F (fe 2 ) = 


massive vector 


W[W(x)W"(y)]\0) 


-> iA^(fe 2 ) 



v -fe^fe7m 2 ) 



(5) Momentum assignment (Fig. 7.9) 




(c) 

Figure 7.9 Momentum assignment to loop 
Feynman diagrams. From left to right: vertex 
correction, photon self-energy and fermion 
self-energy. For the fermion loops —1 should 



be multiplied for each loop. The variable k in 
the figure is not contrained by b functions at 
vertices hence has to be integrated. 



Assign fe; to every boson and pi to every fermion. 



external lines 
internal lines 






->• determined by experimental conditions. 

->• Starting from external lines, assign k;, j>; to 
all the internal lines so as to make the total 
sum = at the vertices. (7.1 

r d 4 k 

~* J m* 

—*■ multiply by — 1 for each fermion loop. 

The momentum can be assigned to all the internal lines starting from the 
ternal lines, whose momenta are decided by experimental conditions. Space-til 

2) The difference from the photon is its mass and longitudinal polarization, which gives an extr; 
term in the numerator of the propagator. 

3) Tree diagrams are those which do not contain loop diagrams. 
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integrals produce d -functions at each vertex, which ensure that the sum of the 
energy momentum is conserved. The d -functions constrain all the internal mo- 
menta in tree diagrams 3 ' and disappear by integration, except the one that ensures 
the conservation of overall energy-momentum. When loops appear as higher or- 
der effects, their internal momentum is not restricted by the (5-function and has 
to be integrated, hence the factor f d 4 k/(2ji) 4 . Examples are shown in Fig. 7.9a-c, 
where fe has to be integrated. For the fermion loops, —1 should be multiplied for 
each loop. 
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Radiative Corrections and Renormalization* 

So far we have calculated the matrix elements only in the lowest order (tree approx- 
imation). Most of the essential features of the process are in it, and it is not the 
author's intention to get involved with a theoretical labyrinth by going into high- 
er orders. However, the appearance of the infinity was inevitable when we tried to 
calculate higher order corrections in the perturbation series. It is an inherent dif- 
ficulty deeply rooted in the structure of quantized field theory. We should at least 
understand the concept of how those infinities are circumvented to make QED a 
self-consistent mathematical framework. We may as well say that the history of 
theoretical particle physics has been the struggle to cope with and overcome the 
infinity problem. The quest is still going on. The standard remedy is "renormaliza- 
tion", which extracts finite results by subtracting infinity from infinity. The process 
is rather artificial, but has stood the long and severe test of observations. Today, 
renormalizability is considered an essential ingredient for the right theory. The 
higher order corrections are indispensable in comparing theoretical predictions 
with precision experiments and serve as crucial tests for the internal consistency 
of the theory. The author considers it important that the reader understand at least 
the philosophy behind it. For this purpose, we describe the essence of the radiative 
corrections in order to understand the renormalization prescription. 1 ' 

8.1.1 

Vertex Correction 

We consider first the effect of radiative corrections on the strength of the coupling 
constant. The strength of the electromagnetic constant is typically measured by a 
low-energy elastic scattering by nuclei referred to as Mott scattering. The scattering 

1) This chapter deals with the self-consistency of QED. The discussion is mathematical and rather 
formal, except for Sect. 8.2, which summarizes the validity of QED. Readers may skip this chapter; 
it is not compulsory for understanding later chapters. They may come back when they have 
gained enough understanding and when they are inclined to. 
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matrix element can be expressed as [see Eq. (6.136)] 



- iq u { p')yfu(p)^p[qj v (q)] 



■1) 



f is the source of an external field or that created by a static scattering center, 
which is symbolically denoted by a cross in Fig. 8.1. 

So far we have treated the interaction of an electron with the field only in the 
lowest order, which is depicted as diagram (b). Higher order effects include the 
vertex correction (c), the photon propagator correction (d), and electron propagator 
corrections (e,f). Emission of very low energy photons (g,h) is physically indistin- 
guishable from the process that radiates none and should be included in a consis- 
tent treatment. We first consider the vertex correction (c) to the process. Applying 
the Feynman rules described in the previous section, —iqy" in Eq. (8.1) has an 
extra contribution 



,qy 



q(Y" + n 

[ **k -ig po 
J (Inf F 



\(-iqy p ) 



(P'-k) 2 - 



2 + i£ > (p_k)2. 



-(-iqy a )\ 



As one can see easily by counting the powers of k, the integral diverges like 
~ log k 2 as fe -*■ oo, which is called the ultraviolet divergence. It also diverges as 
k ->• 0, which is called the infrared divergence. The latter has a logical and physi- 
cally justifiable interpretation and hence is not a fundamental difficulty. 



(a) (b) (c) (d) 



(a) 

+ \yw\x + >vw\x + \Ww* + Yaaaax 
(e) (0 (gf (h) 

QED vertex higher order co 



Figure 8.1 Corrections to the vertex function. 
The effective vertex including overall effects 
is symbolically represented as the black circle 
in diagram (a), which in the lowest order is 
given by the vertex in (b). Higher order effects 
include the vertex correction (c), the photon 
propagator correction (d), and the fermion 



propagator c 



e, f). Em 






low energy photons (g, h) are physically indis- 
tinguishable from the process that radiates 
none. Two kinds of divergences, ultraviolet 
and infrared, occur once diagrams involving 
loops are included. The former is treated by 
renormalization and the latter is canceled by 
(g, h). 
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8.1.2 

Ultraviolet Divergence 

The ultraviolet divergence is inherently connected with the infinite degrees of free- 
dom of the field and rather complicated to handle. A prescription is the renormai- 
ization developed by Tomonaga, Schwinger and Feynman. 2 ' To understand it, let 
us calculate Eq. (8.2) and see how the difficulties arise. Throughout this calculation 
we tacitly assume that the vertex function is sandwiched by two on-shell spinors. A 
current operator sandwiched by on-shell spinors is generally expressed as 

(P'\j f '{x)\p) ~ {p'\]t'{q)\p)e"i- x cf = p'v -pf 

T , io^ v a v , 1 (8.3) 

~u(p')\y»F 1 [q 2 )+^-^F 2 (q 2 )\u(p)e>** 

From Lorentz invariance alone, terms proportional to q 1 ' and p'>'+p'' should exist. 
But the current conservation constrains 

WW ~ &/"(«) = (8.4) 

So there should be no term proportional to q" . y" sandwiched with u(p') and u(p) 

2my f ' = /y" + y" pf = 2p'>' - y f '(f- f) = 2p' f ' - y f ' 4 

= 2p" + (/- f)y» = 2p-"+ hy^ = (p'" + p^) - -[y'', /f\ (8 5) 

= (p' fl + p' 1 ) + i-[y'\ y v ]q„ = (p'f + p") + ia' ,v q v 

where we have used pfu(p) = u(p')p" = m. The relation is called the Gordon equal- 
ity and shows that p /fl + p^ can be expressed in terms of y ft and ia flv q v . In the 
static limit, p /fl = p fl = (m,0), q 2 — >• 0, iq^ = (0, — iq) ~ (0, V), and y 11 coupled 
with the electromagnetic field reduces to 



,<,./ 



(p'\ej d i xe^xp(x)y'^{x)A fl (x)\p) ► \e<p + —a- B\ (8.6) 

This expression means y" includes both the electric charge and the magnetic mo- 
ment of a Dirac particle with g = 2. Then F\(q 2 ) and F 2 (q 2 ) describe the dynamic 
part of the charge and the anomalous magnetic moment, which arise from the 
inner structure of the particle. They are constrained by Eq. (8.6) 

Fx(0) = l, F 2 (0) = Z = g-2 (8.7) 

From the above argument we expect /"'' defined by Eq. (8.2) to have the same 
structure as Eq. (8.3) with Fi(0) = because r fl is defined as a correction to y fl . 

2) Any standard textbook on quantized field theory has a detailed description of it. We quote some 
representative books: [63, 64, 215, 322, 338]. 
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In order to calculate Eq. (8.2), we begin with the Feynman parameter trick: 



ai = k 2 -{i 2 +i£, a 2 = (p'-k) 2 -m 2 + i£, a 3 = (p-k) 2 -m 2 + i£ (8.9) 

where we have introduced a small photon mass fi temporarily to avoid the infrared 
divergence. Then Eq. (8.2) is expressed as 



r f ' = -2ie 2 —-— dz l dz 2 clz i 5{\-z l -z 1 - 

J (2jt) 4 Jo 

N"(fc) 





'(*' 


2 -M 2 + ie) 3 




N fl (k) 


= y. 


,(/-/f+m)y"(j/- 


-}i+m 


k' 


= k 


- p'z 2 - pz 3 




M 1 


= m 


2 (l-z 1 ) 2 + fc 2 z l - 


q 2 z 2 Zi 



(8.10b) 



Now we change the integration variable from k to k' , and rewrite the numerator in 
terms of ¥ . Then terms in odd powers of k' vanish because of symmetry. Rewriting 
k' ask, the numerator becomes 



N f '(k) -> Nf(k + p'z 2 + pz 3 ) = Ni + N 2 + (terms linear in F) 
Ni =-2[-k 2 y f ' + 2k f, k"y„] 

N 2 = y,[/(l " «2)- /«3 + «]?"[- /z 2 + /(I - z 3 ) + m]y " 
After straightforward but long calculations of Dirac traces, using relations 

YvY f 'y v = -2y / ', 
y F A#y"=4(A-B), 

y v 4$q y v = -2qi}4 

Y f '4 = q" + io^' v q v 
4Y f ' = q" ~ Wqv 
and Gordon decomposition, N 1 ' becomes 

N" = Ni+ N 2 = -2[-fe 2 y" + 2Ffc 1 > v + q 2 (l - z 2 )(l - z 3 )y'' 
+ m 2 (l-4z 1 + z 1 2 )y'' + Zj(l - z^io^mq,, 
+ (2-z 1 )(z 2 -z i )mq"] 



The term proportional to q fl is forbidden by current conservation. Here, the coef- 
ficient of qf is odd for the exchange z 2 *> z 3 and therefore vanishes after integra- 
tion over z 2 and z 3 , as it must. The first two terms are divergent, as can be seen by 
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counting the powers of fe. Obviously we need to separate the divergent and finite 
parts and extract a meaningful result out of them. The simplest method is simply 
to stop integration at some finite cutoff value A . But usually that destroys the built- 
in symmetry. For QED, the Pauli-Villars regularization method which respects the 
Lorentz invariance has frequently been used. 3 ' But for non-Abelian gauge theory it 
is unsatisfactory because it does not respect the gauge symmetry and the preferred 
method is dimensional regularization, where the space-time dimensionality is ana- 
lytically continued from the real physical value, four, to a complex value D. For real 
D < 4, the integral of the form /' d D k/(k 2 + • • • ) 2 converges. When we recover the 
original dimension D = 4— e,e^0, the divergence reappears in the form of a 
pole at D = 4, which can be subtracted. Other poles appear at D = 6, 8, . . ., but it 
turns out that the integral is analytic in the complex D-plane and we can make ana- 
lytic continuation to any complex values except those poles. The relevant formulas 
in this case are [see Eq. (D.20) and discussions in Appendix D] 

J [fe 2 _ M 2 + i£] n ~ *(*) (M 2 )n -D/2 f {n) I 8 ' 15 *) 

f 2 kW = i ( ,)°/Vl)-i 2 DI2 1D / {n - 1 -^ (8.15b) 

J [k 2 -M 2 + i£]" * > * > {M 2)n-l-D/2 f {n) \ ) 

r k''k"d D k _ g uv r k 2 d D k 

J [k 2 -M 2 + iE\- ~~D J [k 2 -M 2 + ie]» (8 ' 15C) 

where r is the well-known gamma function defined by 

r(x) = / dte"'t x_1 (8.16) 

Setting n = 3, defining an infinitesimal number e = 4 — D and using 

F (l) = \ ~ 7E + ° {£) (8 - 17a) 

x tp. _ e (s/2)ta* _ x + £ ln x + ( £ 2j (8.17b) 

we obtain 

/)+tt2 

- ln M 2 ] 



J [k 2 -M 2 + ief = l ^ A - h 



3) In this method the propagator is replaced 



nade finite. After the calculation, A -»■ oo is taken to recover the original foi 
ms appear as A 2 or lnvl. 
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d 4 kkfk 



(8.18c) 



r d 4 kkfk v _ gf" r d 4 kk 2 

J [fc 2 -M 2 + ie] 3 "Tj [fe 2 -M 2 + i£] 3 

r d 4 k ill 2 1 

J [k 2 -M 2 + ie] i ~ Tm 2 ( ' ' 

where y E is the Euler constant. 4 ' The divergent part is included in A . Inserting the 
above equalities, we obtain 

a r°° r 

r fl = — / dz 1 dz 2 dz i d(l-z 1 -z 2 -z i )\(A-\nM 2 )y' 1 
In Jo L 

+ y!l ^ ( l-4z 1 + zi) + q 2 { l-z 2){ l-z^ (82o) 

i<7'"' ?F 2m 2 z 1 (l-z 1 )l 



2m M 2 J 

Though we could separate the divergent part, it is by no means unique. If we add 
some finite part to the infinity, it is still infinity. To determine the infinite part, we 
note that, in the static limit, the current has to have the form of Eq. (8.6) and _T'' 
has to vanish in the limit q 2 — > 0. This suggests a prescription to subtract terms 
evaluated at q 2 = 0. Then we can write 

,,* + r" = y" + [r"(q 2 ) - r"(0)] = Y'fitf) + l -^-Ui 2 ) (8.21a) 



h £l 



F i(? 2 ) = l + TZ I. dz i dz 2dzid(l - z x - z 2 - z 3 ) 

J rnHl-zrf + ^z, \ 

\m 2 (l-z 1 ) 2 -q 2 z 2 z i +fi 2 z 1 J 
i 2 (l-4z 1 + z 1 2 ) + q 2 (l-z 2 )(l-z 3 ) 

m 2 (l - Zi) 2 - q 2 z 2 z } + fi 2 Zy 
t 2 (l-4zi + z 2 ) ] 



m 2 (1 _ Zl) 2 +/ ,2 Zl j 

F 2 (q)=— dzidz 2 dz i 5(l-z l -z 2 -z i ) 

Z -, (8.21c) 

x f 2m 2 z 1 (l-z l) 1 l ' 

|_m 2 (l - zi) 2 + ia 2 z x - q 2 z 2 z 3 - ie\ 

The expressions no longer contains the ultraviolet divergence. The treatment of the 
finite part of Fi(q 2 ) will be postponed until we discuss corrections to the electron 
propagator. The calculation of F 2 (q 2 ) is easier. It does not include the ultraviolet 



■ E = lim Vi-lnN =0.57721566 ■■ 
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r infrared divergence. We can safely set ft = 0. To evaluate the static anomalous 
nagnetic moment, we can set q 2 = and 



F 2 (0) = ^^ dz 1 dz 2 dz i d(l-z 1 -z 2 -z i )\ 



2m 2 z 1 (l-z 1 )l 



(l-zi) 2 J 

v ' J (8.22) 

Equation (8.7) tells us that the real value of the magnetic moment of the electron 
fi = g(e/2m)sisnotgivenbyg = 2but(g-2)/2 = F 2 (0) = a /2jr ~ 0.0011614. 

8.1.3 

infrared Divergence 

Fi(q 2 ) needs more elaborate calculation. Here we concentrate on the dominant part 
as fi 2 ->■ 0. It is given as 

a f 

Fx(q ) ~ 1 + — / dz l dz 2 dz i d(l -z l -z 2 - z 3 ) 

[ m 2 (l - 4zi + z 2 ) + <? 2 (1 - z 2 )(l - z 3 ) 



m 2 (l-z 1 ) 2 -q 2 z 2 z 3 +// 2 2 
i 2 (l-4z 1 + z 2 ) I 



(8.23) 



m 2 (l-z 1 ) 2 + / < 2 z 1 J 
The divergence occurs in the integration region where z x ->• 1, z 2 , z 3 ->• 0. In this 
region, we can set z\ = 1, z 2 = z 3 = in the numerator of Eq. (8.23). We can also 
set n 2 Zi -» /< 2 . The integral then becomes 



F 1 (q 2 )~l + ^- [ dz x f l dz 2 
2ji Jo Jo 



m 2 + a 2 



i 2 (l - Zl ) 2 - ? 2 z 2 (l - Zl - z 2 ) + n 2 m 2 (l - z x ) 2 + fi 2 J 
(8.24) 



By changing the variables 

| = , n = 1 — Zi, dz\dz 2 — —rjd^drj 

l-zi 

we obtain 

x f -2m 2 + q 2 ~2m 2 ' 

L[m 2 - ? 2 s fc (l-§)] ?? 2 + ^ 2 mV + /< 2 . 

= l-^/' 1 d| 

2jt y 

r m 2 -g 2 /2 ( m 2 -g 2 g(l-g) ) 
X Lm 2 -^(l-g) ln i J 2 j" 
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In the limit fi 2 -> 0, the parameter | in the logarithm is not an important part of 
the integrand. We have already encountered the coefficient of the log function as 
part of the soft photon emission probability [see Eq. (7.79)]: 

( , , , •, , ( 8 -27) 

^ [-q 2 /(3m 2 )]+0{(- ? 2 /m 2 ) 2 } -q 2 « m 2 

\\n(-q 2 /m 2 ){l + 0(m 2 /(-q 2 ))} - 1 -q 2 » m 2 

The final expression for F 1 (q 2 ) becomes 

({-q 2 /(lm 2 )]\n(m 2 /ft 2 ) -q 2 « m 2 
\\n(-q 2 /m 2 )\n(-q 2 /fi 2 ) 

The q 2 dependence of the second line is known as the Sudakov double logarithm. 
Since this is a second-order correction to the vertex, the contribution to the cross 
section for the elastic scattering can be written (for the case q 2 » m 2 ) as 

da(p -y p) = da \ F^q 2 )] 2 = da \l - ?- In =J- In ^\ + 0(a 2 )l 

(8.29) 

The cross section for the soft bremsstrahlung (contribution from Fig. 8.1g,h) was 
given in Eq. (7.79): 

da(p -+ p'\ = dcTo-ln— 'Un^i^ (8.30) 

it m l [A L 

When the two cross sections are added together, we find 

da(p -+ p') = da 1 - - In ^\ In ^— + 0{a 2 ) (8.31) 

which is a finite, converging result independent of /i 2 , as desired. 

Summary of the Infrared Divergence The infrared divergence is induced by low-en- 
ergy (k — >• 0) photons (soft photons). Diagram (c) in Fig. 8.1 is a third-order [0(e 3 )] 
effect but it is indistinguishable from the lowest order effect. Therefore, their con- 
tributions have to be added to the lowest order vertex diagram (b) and squared to 
derive a measurable quantity. Diagram (c) produces an 0(e 6 ) effect by itself, but 
also produces 0(e 4 ) by interfering with (b). This is what we have in Eq. (8.29). 
Bremsstrahlung processes that emit very low energy photons (g,h) cannot be dis- 
tinguished from those which emit none, and their contributions have to be added 
(after squaring the amplitude). This effect is of 0(e 4 ). We have shown that the con- 
tribution from emission of a real soft photon and that from the soft-photon loops 
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compensate each other and no infinities appear. This is true for all orders of per- 
turbations in QED and is demonstrated in the next section. In fact the infrared 
compensation works for any gauge theory. 

The reader may consider that the photon- emitting process is physically distinct 
from that with no emission. It is certainly true that high-energy photons can be 
measured and the process is distinct, but consider experimentally. The passage of 
the electron is measured typically using counters of some kind and its momentum 
by bending in a magnetic field. The counters have a finite size and finite energy 
resolution, and low-energy photons that fly close to the electron are measured to- 
gether with the electron, and cannot be separated. If you say this is the problem of 
the detector and not the principle, I would argue that even in principle one could 
never make a detector with infinite accuracy. The detector cannot avoid counting 

1, 2, 3 n extra photons, and the sum of all of them becomes finite theoretically. 

Therefore, for the calculation of the infrared contributions, knowledge of the exper- 
imental setup becomes necessary in principle. In practice, the calculation is carried 
out with some cutoff. If the soft photons are not observed, their contribution can be 
integrated over the variables. When one calculates the total cross section or inclu- 
sive processes (in which one measures some finite number of particles and does 
not observe the rest), one can derive a finite result independent of the experimental 
conditions. Thus, we can consider that the infrared divergence is not an inherent 
problem. 

8.1.4 

Infrared Compensation to All Orders- 
Here we show that the infrared divergence due to photon emission is compensated 
by that of virtual photon exchange to all orders. The most comprehensive treatment 
of the infrared divergence is given in [397]. Multiple emission of soft photons was 
first considered by Bloch and Nordsieck [67] classically. The discussion here follows 
closely that of Weinberg [382]. 

We have learned that the soft bremsstrahlung process was well approximated 
by elastic scattering of an electreon at a nucleus multiplied by the probability of 
emitting a soft photon [e(p) ->• e(p') + y(fe)]. The cross section in the soft-photon 
limit was given by Eq. (7.74) 

da(p^p'+k) = da(p^p')P Y 

d'k ^ .J fcg/) (f A -ff,) | 2 

\(k- Pf ) (k. Pi )\ (8.32) 



Py ~ J 2<o{2nf f- ' 



~ — In —V In —^ - q 2 » m 2 

n m 2 [i 2 

where ji is the small photon mass introduced to avoid the infrared divergence and 
^max is the maximum photon energy the detector is capable of detecting. We have 
seen that the infrared divergence vanishes to order a if we include the virtual pho- 
ton contribution to the vertex. 
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Here we intend to show that the cancellation is effective at every order of photon 
emission. This may seem an impossible task, since there are a variety of ways in 
which the infrared divergence can enter the various Feynman diagrams. However, 
the problem is made tractable by extracting only those in which soft photons are 
emitted from the incoming or outgoing electrons on the mass shell. 

The Feynman diagram where a total of n soft photons are emitted from on- 
shell electrons has the structure depicted in Fig. 8.2a. Only soft photons emitted 
from (nearly) on-shell electrons contribute, because the denominator of the elec- 
tron propagator before (or after) the photon emission is given as 



(8.33) 



and becomes large only when p 2 — m 2 = and fci -*■ 0. Contributions of hard 
photons or soft photons emitted from deeply off-shell electrons, where (p ± fe) 2 — 



First we consider a process where all the soft photons are emitted from the out- 
going electron. The amplitude has the following form, where we have numbered 




HARD PHOTONS 




\m/w\/ 



Figure 8.2 Diagrams that contribute to the 
infrared divergence, (a) Only soft photons 
emitted from on-shell electrons, namely thos 
incoming and outgoing electrons at the oute 
legs give large contribution. Soft photons or 
hard photons from deep off-shell electrons 




^*A/WV 



(denoted by the gray blob) do not contribi 
much, (b) Only soft-photon internal lines 
changed by on-shell electrons contribute t 
cancel the divergence created by real phot 
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the photons as in Fig. 8.2: 

J (8.34 

„ ijf +& + -+ ]tj+i) + m ( 

2(p'-h + ---+k j+1 ) {P} 

We are only interested in the soft-photon limit (w = \k°\ — > 0). Therefore, we ca: 
neglect |t terms in the numerator. Next we move f to the left and use u(p')(rf' - 
m) = 0. Namely 

mQ»')/v + »)/v + »)■■•«(?) 

= ¥Q/)[2( £ * • j>') - (/ - m) /*](/ + m) ■ • • u(p 
= u(p')2(e*-p')f(p / ' + m)---u(p) 
= u(p')2(E*-p')2(e*-p')--u(p) 

Thus all the numerators can be replaced with (2e* • p'). Furtl 
the formula 

Y — 

^ In'- 



(8.35) 



h) (p' ■ (ki + fe 2 » (p'-(ki + -kn)) 



(j>'-ki)(p'-fcz) (p'-fe„) 
which can easily be proven for n — 2: 



The result can be proved for general n by induction. Then the scattering amplitude 
with subsequent n-photon emission that is effective at kj —*■ can be written as 

u(p')iM el u(p) f] g(£ * 'p (8.38a) 

,=i \P ' K ii 



Next we assume all the soft photons are emitted from the incoming electron and 
repeat the same procedure. We obtain 

u(p')iM cl u(p) f[ (-^y) (8.38b) 

which can be obtained from Eq. (8.38a) by the substitutions 
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as can be seen in the lower part of Fig. 8.2a. In reality, the n photons can be emitted 
from either the outgoing or the incoming electron. For this case, we have 



u(p')i 


■M e l 


u(p) 


perm. 


W 






(P 


■pY 


u(p')> 


M c] 


u(p) 


o4 


(e*-p') 

xv'-h) 


(e' 


>_P)] 






(P 


■kj)\ 





Then the probability of emitting n photons is obtained by squaring the amplitude, 
taking the polarization sum, which gives JZi( e " ^)( e * " B) = ~~ (-A ■ B), and mul- 
tiplying by the final-state phase space volume. Note 1/n! comes in because of the 
Bose statistics of the photon. Summing over all n's, we obtain 

da = da(p -> p') ^ — R n = do(p -»■ j/) exp[R] 



-/f 






- -In^Jln^^f j for-, 2 = -(!' , -!') 2 »m 2 

where a> max is the maximum energy the detector is capable of seeing and ji is the 
photon mass temporarily introduced to avoid the infrared divergence. Note the one- 
photon emission probability R was already given as P y of the bremsstrahlung cross 
section do(p -+ p' + y) = da(p ->• p') P 7 in E q s - ( 7 - 74 ) and ( 7 - 79 )- 

Next we consider the corresponding internal soft-photon contributions. Corre- 
sponding to the n real-photon emission cross section, we need to consider an in- 
terference term of order 2 n (n soft internal lines) with the Oth-order diagram. First 
we arrange 2 n external photons arbitrarily on either outgoing or incoming electron 
lines. Then we pick up a pair (k;, kj ) of photon lines and connect them to make an 
internal line. Recollecting that to each external photon line a factor (e* ■ p')/(p' ■ fc;) 
or — (e* -p)/(p ■ fe,) is attached, depending on whether the i,j th line is emitted from 
the outgoing or the incoming electron, we have an amplitude for each internal line 

(8.41) 



,2 f d^_ 
' J (2^r) 4 



k? + ie \(Pi-ki)J \-( Pj .ki)J 



where the dot product is produced by —gf, v (— J2 € ^ € v) in the numerator of the 
photon propagator. p\ and pj stand for pi or p in the numerator and pi or — p in 
the denominator, depending on whether the photon i, j is emitted from the out- 
going or incoming line. The extra minus sign in the second denominator appears 
because if we define fe as the photon momentum emitted by the line i then fe must 
be absorbed by line j. 

In making n internal lines out of 2 n external lines for all possible combinations, 
we use Eq. (8.39) to express 2n external photon lines, choose lines i andj (i / j) 



8. 1 Radiative Corrections and Renormaiization- I 
and connect pairwise to give 

& tJ.fJJp'-h) \ (p-kj)) 

where X! e f e J I i= j = — g" " was use d m going t0 tne third lme - We have to divide 
the sum by 2 n n! because, on integrating over fc;, interchanging n virtual photons 
and two directions of a pair with each other produces the same diagrams. Therefore 
the sum of all different combinations gives 

yn 
A^rtual = — (8.43) 

y = *t#*_^_(j£ *-).(-£ P—\ 

2 J (2jz)*ki + ie \(p'-k) (p-k)J \-(p'-k) -(p-k)J 

-"S ln (^) ln (~^") for-q 2 »m 2 (8.44) 

Derivation of the last line can be done following the same logic as was done to cal- 
culate the one loop vertex correction function of Eq. (8.2). In fact, one can recognize 
that Vis just its infrared limit which resulted in Eq. (8.28). Virtual photon exchange 
diagrams may contain m extra external photon emissions, but they factorize and 
constitute a part of Eq. (8.40). Therefore adding 0, 1, 2 . . . , n, . . . virtual photon ex- 
change contributions, we can obtain the correction factor A virtua i that multiplies 
the elastic scattering amplitude. The scattering cross section for emission of any 
number of photons becomes 

i>( P -*'+» y )=*7o(i: ^)|f^| 2 

= d<7 exp(R)exp(2V) 

= da expr--ln(-^- jlnT^^n for q 2 » m 2 (8.45) 
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This expression depends on the detector sensitivity, but no longer on the infrared 
cutoff^ 2 . Unlike the 0(a) correction, which could go to — oo, 5) the correction factor 
is finite and lies between and 1. If we set £o min instead of u) m3x as the minimum 
energy the detector can detect, the probability of a scattering cross section with no 
photon emission is 

da(p ^p' + Oy) = dffoexp \-^ In C^L\ In f^M] (8-46) 

which decreases faster than any power of q 2 . The exponential correction factor 
contains the Sudakov double logarithm and is known as the Sudakov form factor. 

n-Photon Emission 

If we set up a detector to observe any number of photons with energy w min < co < 
w max . the probability of finding n photons would be proportional to R n /n\, where 
R is given by integrating the photon energy from <w min to a) m3x in Eq. (8.40) 

P„ oc — R n 



„_ f d 3 k 2 ^f( £ *-p') (e* x -p)\ 

R -J (2^; e M( 



K(P'-ki) (p-h)J 



As the total probability of observing any number of photons should equal 1, ^ 
calculate the normalization constant N from 



1 = N J^ — = NeR ( 8 - 48a ) 

P n =^-e~ R , R = --\n(^ T \\n( ( ^A (8.48b) 

This gives a Poisson distribution, as we expected from discussions in Sect. 7.3.1. 

8.1.5 

Running Coupling Constant 

We now turn to other higher order corrections that involve the self-energy of the 
photon and the electron. Let us consider the photon propagator (Fig. 8.3). 

When the contribution of the loop correction (second diagram on the right hand 
side, rhs) is added, the photon propagator is modified from the lowest order ex- 

5) The whole perturbation argument breaks down when aln(m/,«) becomes comparable to 
unity, which occurs at fi ~ e~ 137 m ~ 10~ 60 m. Physically this is an absurd situation, but 
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VV\^^V\A/ = VWVWV + f\/\f JVV + 'V\/ JvvT \a/ 



Figure 8.3 When the higher order radiative corrections are added to the lowest propagator Dp, 
it is dressed to become /'DF' = iD t + iD f (—iZ yy )/D F + .... Here only ladder-type cc 
are summed and included. 



' £ + h^ inXu ^ = IV" + 7i(- i77 '"')l i < 8 - 49a > 



J (2. 






(8.49b) 

There is an extra minus sign multiplied for each fermion loop. It originates from 
the anticommutativity of the fermion field. The first term in the first line is the 
lowest order photon propagator, which is the Feynman propagator of the free field. 
Counting the powers of fe, the integral seems to diverge as ~ fe 2 . However, because 
of gauge invariance, 77<" v satisfies the constraint 

q^n"" = qyll 1 "' = (8.50) 

which means its functional form has to be of the type 

77"" = (q Y v - cfq v )II 7 (8.51) 

Therefore, the powers of fc are effectively reduced by 2 in TI y , and it only diverges 
as log fc 2 , just like the loop integral of the diagram in Fig. 8.1c. For the sake of later 

S yy = q 2 n y (8.52a) 

_►#*"= ^-J^Syy (8.52b) 

If we expand n y (q 2 ) in Taylor series 6 ' 

n y (q 2 ) = n y (0) + q 2 ^- 2 n y (q 2 )\ 2 _ o + ■•• = n Y (0) + n y ( q 2 ) (8.53) 

By definition II y (0) = 0. The power of fe 2 in the integral Eq. (8.49b) which con- 
tributes to the coefficients of the Taylor series reduces as the power of q 2 increases. 
Since the constant term 77^(0) contains the highest power of fe 2 , if it diverges at 
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most logarithmically, we expect that coefficients of the higher q 2 powers do not 
diverge and are finite. 

Indeed, actual calculation (see Appendix D for details) shows that the divergent 
term is included only in 77 y (0): 

n y (q 2 ) = n y (0)- 2 ^^ rfz z(l -z) In Tl- ^z(l-z)] (8.54) 

n y (0)=^(A-\nm 2 ) (8.55) 

A is the divergent term and is given by Eq. (8.18). If we use dimensional regulariza- 
tion, and carry out the integration in D-dimensional space, the integral converges 
for D — e, e > 0. The divergence comes back when we make e — ► as A contains 
a pole term 1/e. The second term is well defined and finite and behaves like 



Including the second-order effect, the lowest order Mott scattering amplitude 
Eq. (8.1) is modified to 



u(p')Y"u(p) fe + ^\(-iII la )^\ [ej"(q)] 



The term proportional to q 11 q v vanishes when coupled with the conserved current 
(qujt 1 = or q^uy^u = 0). The above expression gives 

i e u(p')yf u (p)^f- [l - 77,(0) - fl y (q 2 )] [ej v (q)] 

q _. (8.57b) 

~ i e u(p') Y "u(p)-^[l + 77 7 (0)]- x [l - 77 7 (q 2 )] [ej"(q)] 

Then defining 

77 y (0) = — (A -mm 2 ) = 1 = dZ 3 (8.58a) 

e R = e [1 + 77 y (0)]" 1/2 ~ Z 3 1/2 e (8.58b) 

we rewrite Eq. (8.57) to give 

-> ie R u{p') Y ^u{p)^f- \l-fl y (q 2 )]le R j''(q)] (8.59a) 



We shall call e the bare coupling constant and e R the renormalized coupling o 
stant. The bare coupling constant, which appears in the original Lagrangian 
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then not the true coupling constant. The true coupling constant should include all 
the higher order corrections and is an observable because it appears in measurable 
quantities such as cross sections. Here we have treated d Z 3 = TI y (0) as a small 
number in the perturbation series in terms of a, though actually it is divergent. 

In effect, we have subtracted infinity from infinity to obtain a finite expression, 
which is not allowed because it is an ambiguous procedure. That is what we learned 
in high-school mathematics. The justification is as follows. Higher order correc- 
tions for the photon come in because of the interactions with charged particles. 
But if q 2 = 0, the photon is on shell, and it is a freely propagating wave. The field 
is completely decided by the equation of motion without a source. We know the 
propagator for the free photon. It is the Green's function to the free Klein-Gordon 
equation, which is —ig'" v /q 2 - Whatever form the modified propagator takes, it has 
to respect this condition. Therefore, in principle, the correction to the lowest order 
term must vanish at q 2 = or Tl y (0) = must hold. In practice it does not, so we 
subtract it as redundant. 

Now, the photon can make an electron-positron pair at any time and acquire an 
extra self-energy. So applying the replacement Eq. (8.49) to the photon propagator 
successively, the net effect is to change D F —*■ D' F (to dress the photon), the dia- 
grams of which are depicted in Fig. 8.3. Only ladder-type diagrams are collected 
here. More complex diagrams have to be included if we go to higher orders. 

Then the expression for D' ¥ is given by 

D'/V)= f-Jl + H2 y7 )— +( -iZ 7Y ) ( -iZ 7Y ) +... 



' i[q 2 + 2 rr (q 2 )] iq 2 l + n y (q 2 ) 
_ g" v 1 _ g"" 



iq 2 1 + n } , (0) + fly (q 2 ) iq 2 Zf 1 + fl r (q 2 ) 

= f^- —~ (8.60) 

iq 2 l + Z i Tl Y (q 2 ) 

As the propagator is always sandwiched between two vertices, it accompanies e 2 
[see Eq. (8.49)]. Therefore, we move the Z 3 factor to them and r< 



fly itself is proportional to e 2 and appears as multiplied by Z 3 . Therefore we can 
replace e 2 in fl y by e\, too. 

Z i fl y (q 2 ,e 2 ) = fl Y (q 2 ,el) (8.62) 

Although Z 3 is a diverging constant, if we consider Z 3 and consequently e R to 
be finite, the photon propagator can still be expressed as g f "' /q 2 , but the coupling 
modified to 



1 + Z 3 ny(q2,*) l + n Y (q 2 ,ei) 



(8.63) 
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e\ is now a function of q 2 and is referred to as the effective or running coupling 
constant. Apart from the treatment of the infinite constants, all the higher order 
corrections to the photon propagator result in simply making the coupling con- 
stant run. From the above argument, the ultraviolet problem, as far as the photon 
correction is concerned, can be contained in the coupling constant, a procedure 
called renormalization. 

Finally, we give a form frequently quoted as the running coupling constant of 
QED at high energy. Substituting Eq. (8.56) in Eq. (8.63) we have at large Q 2 = 
-q 2 > 



where a = a(0). The screening due to vacuum polarization gives the smallest 
value of a at infinite distance. Conversely a becomes larger at larger value of Q 2 
or at shorter distance. 

8.1.6 

Mass Renormalization 

Next let us consider the radiative corrections to the electron (Fermion) propagator. 
As an example consider the Compton scattering amplitude. Its inner line corre- 
sponds to the propagator 



The next higher order correction is a process to emit and reabsorb a photon, as 
shown in Fig. 8.4. Using the same argument for the photon propagator, we may 
sum all the ladder- type repetition of the self-energy: 

iS F (p) -> iS^(p) = —^ + —L-(iS t (p))-^ + ■■■ (8.66a) 

is 'M = / (^F^T7i^ V^-m + i e (iey,) (8 ' 66b) 

where we introduced a small photon mass fi to avoid the infrared divergence. As 
we can see by counting the powers of k, the integral diverges as k -* oo. 2 e (p) 
can be calculated again using the same regularization method as the photon (see 

,' ,' . _j^_ + <V ♦ '^.'.fv t 

Figure 8.4 iS'^ = iS? + iS?{-iZ e )iS f + .... 



/ Radiative Corrections and Renormaiizatioi 



Appendix D). Then S$(p) can be rewritten as 



' j-m + Z e (p) 



(8.67) 



Note, the pole of the propagator has shifted, meaning the mass value has changed 
as the result of higher order corrections. Then the original mass term m, called the 
bare mass, that appeared in the Lagrangian is not the true mass, but a parameter 
having no physical meaning. The true mass should include all the corrections and 
the propagator should have the pole at the correct (or observed) mass, which we 
denote by m R . Then we perform the Taylor expansion around p 1 = m R in the same 
spirit as we did for the photon propagator around q 2 — 0: 

I e (p) = dm + [6Z 2 + 77 6 (j/)](/- m R ) (8.68a) 

dm = I e (p)\ lf=mR = I c (m)+ 0{a 2 ) ~ -mA (8.68b) 

6Z 2 =—-l= ^ e(P) ~+—A (8.68c) 

Z 2 df | >=Mr 4jt v ' 

77 e (m R ) = (8.68d) 

where ~ means that some finite contributions are omitted. The diverging quan- 
tities are contained in dm and dZ 2 . After they are subtracted, the remaining n e 
is finite. /7 e (m R ) = since it is the third term of the Taylor expansion around 
/ = m R . Using the above quantities, S^ (8.66) can be rewritten as 



"Mr; y_ m + St(p) [Z -i + n,(pi)]tf- m R ) 




iZ 2 


(8.69) 


[l + Z 2 n e (p 2 )](j/-m R ) 




the propagator accompanies e 2 also, the replacement e 2 — > e'^ = Z 2 e 
2 diverging Z 2 , in the same way as we removed Z 3 . Therefore 


Removes 


■ 2c ■ '2 c e 'R 1 


(8.70a) 


lcMp) ' lc ^ i + n e{P 2 )^-m R 


e' R = Z 2 e 2 , m R = m-dm 


(8.70b) 



Thus we have shown that the renormalization effect is to make the replac 

m —*■ m R and multiply the electron propagator by Z 2 and the photon propagator 

Z 3 . Z 2 and Z 3 are absorbed in the definition of the running coupling c< 



(i + n e (p 2 )(i + n Y (q 2 ) 
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where F x is a vertex correction Eq. (8.24). Note, when the higher order corrections 
are applied to the external lines, p 2 = p' 2 = m| or q 2 = for the photon, 
77 e (m R ) = 77^(0) = holds. Therefore, the effect of the renormalization on the 
external line is only to change m ^ m R and multiply the wave functions by Z 2 or 
Z 3 (see arguments in Sect. 8.1.8). 

8.1.7 

Ward-Takahashi Identity 

The correction to the vertex function was calculated from Eq. (8.2) 

U(p')y"u(p) -> u(p')[y" + n l ]u(p) (8.72) 

where r^ is given in Eq. (8.20) and includes a divergent term. It can be separated 
and removed in a similar way as was done for the propagator. Namely, when we 
take the limit 

? = p '_ p ^ , f^m, i^m (8.73) 



r-'(p' ,p 2 ) = r''(m 2 , m 2 ) + (finite term) = (Z 1 1 - l)y>' + (finite term) 

(8.74) 

As a consequence, the vertex is modified by higher order corrections to 

y fl -+ Z~ l x y''(l + finite term) (8.75) 

Actually, Z\ can be obtained without explicitly calculating Eq. (8.2). The trick is to 
use the following relation: 



S f (p)Si 1 (p) = l (8.77) 

Differentiating Eq. (8.77) and using Eq. (8.76), we obtain 

^Si\p) = Si'ip) (^-/AP)) Si 1 = y" (8.78) 

The meaning of Eqs. (8.76) and (8.78) is that the derivative of a propagator with 
respect to the momentum is equivalent to attaching a zero-energy photon or con- 
verting the propagator to a vertex with zero momentum transfer. The argument 
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can be generalized and any vertex function can be obtained by differentiating the 
corresponding propagator: 



, dS^p) 



which is referred to as the Ward-Takahashi identity. It is easily seen that the rela- 
tion holds between the lowest corrections Eq. (8.2) and Eq. (8.66b). Differentiating 
Eq. (8.68a) and comparing with Eq. (8.74), we immediately see 



Renormalization of the Scattering Amplitude 



To see where all the renormalized constants Z\, Z 2 , Z 3 and dm end up in the final 
scattering amplitude, we consider an example, Compton scattering (Fig. 8.5). 




Figure 8.5 (a) Compton scattering in the lowest order, (b) Up to fourth order with dressing. See 
Fig. 8.6 for dressing. 



The scattering amplitude corresponding to the first Feynman diagram on the rhs 
in Fig. 8.5a is expressed as 



e'Vfiey") 



1+1- 



-(iey v )u, 



(8.81) 



where we have suppressed the spin and polarization suffixes and have written 
u s (p') —> u', u r (p) — >• u, £ f ,(k' , A') — >■ e'. Taking into account higher order correc- 
tions, the internal as well as external lines of the electron propagator acquire Z 2 in 
addition tom^ m — dm, the photon propagator Z 3 , and the vertex function Zj - 1 . 
Namely, after including the corrections, the scattering amplitude is modified to 



M~(Z 3 s' /l *)(Z 2 u')(iZ^hy' J )( 



\pf+ ]L-m + dm) (8.82) 

x(iZ^ e y v )(Z 2 u)(Z i e v )(l + ---) 

where • • • are contributions from fl c (p 2 ), fl r (q 2 ), finite terms in the vertex correc- 
tion and others. We divide Z 2 , Z 3 into two as Z 2 ' x Z 2 ' , Z 3 ' x Z 3 ' and move 
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them to combine them with the wave function and the coupling constant e. Then 
M ~ (Z^s'^Z^u^iZ^Z^ey^^- 

x (iZr 1 Z 2 zl' 2 ey v )(Z 1 2 /2 u)(Z 1 J 2 E )(l + ---) (8.83) 



The modified expression does not contain diverging constants (Z;'s, dm). In si 
mary, by the replacements 

6 ^ eR = Z^Z 2 zl' 2 e=zl l2 e (8.1 

m^ m R = m-dm (8.84b) 

xf, -» ^ R = Z 2 1/2 V (8.84c 

A„ -> A R/ , = z\ l2 A fl (8.84d) 

all the diverging constants are absorbed in the redefinition of the coupling 
stant, mass, and wave function normalizations. Corrections to the second Feyn- 
man diagram in Fig. 8.5a can be treated similarly. We call the propagators and the 
vertex functions with corrections "dressed". Then the two lowest order amplitudes 
written in terms of dressed particles are modified to the two diagrams written in 
thin bold lines in Fig. 8.5b. The dressed diagram contains diagrams as depicted in 
Fig. 8.6. 

Figure 8.6 Irreducible diagrams such as the diagrams can be included in the irreducible 

first term on the rhs can be dressed by taking diagram simply by dressing the propagators 

the self-energy and vertex corrections. Re- and the vertices of the irreducible diagram, 
ducible diagrams to the right of irreducible 

There are other higher order effects that cannot be expressed by simply dressing 
the particle, for example the third and fourth diagrams in Fig. 8.5b. Their contri- 
bution is finite and is included in (1 -\ ). They are referred to as irreducible dia- 
grams, which are defined as ones that cannot be separated by cutting a single line. 
They have to be calculated separately. Examples of reducible diagrams are shown 
on the rhs of Fig. 8.6. The reducible diagrams in Fig. 8.6 (diagrams on the rhs) can 
be included in the irreducible diagram (the first diagram) by simply dressing its 
propagators and vertices. 
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The prescription developed above seems rather artificial. All the infinities are 
contained in the mass m R , the coupling constant e\ and the wave function normal- 
ization. We simply swept the dust under the carpet. However, what we measure in 
experiments are masses and coupling constants with all the theoretical corrections 
included. Then m, a that appear in the Lagrangian should be regarded as mere pa- 
rameters, and measured numbers, such as m = 0.511 MeV, a = 1/137, . . . should 
be assigned to m R = m — dm and e R = e — d e. As renormalized theory is written 
purely in terms of e R and m R , which are observed quantities, it does not include 
any infinities and is mathematically consistent in itself. QED, or quantum electro- 
dynamics, thus established by Tomonaga, Schwinger and Feynman was the first 
successful gauge field theory that could give consistent and unambiguous predic- 
tions starting from first principles. Nevertheless, ad hoc treatments of infinities, 
such as renormalization, can only be justified if they agree with experiment. 



8.2.1 
Lamb Shift 

The first term in Eq. (8.59) of the Mott scattering amplitude with radiative correc- 
tion represents the usual Coulomb force in the static limit. The second term is 
the QED correction and is referred to as the vacuum polarization effect. In quan- 
tized field theories, the vacuum is not an empty space but constantly creating and 
annihilating charged particle pairs. Under the influence of the applied external 
electromagnetic effect, those pairs naturally polarize. The vacuum behaves like a 
dielectric material and its effect appears as a part of the radiative correction. One 
phenomenon in which this correction appears is the energy shift of the hydrogen 
levels. They can be calculated by including the Coulomb potential in the Dirac 
equation, but without using quantized field theory the levels 2s\n and 2pi/ 2 are de- 
generate. Experimentally, there is a gap between the two energy levels (see Fig. 4.1), 
which is called the Lamb shift [248]. The energy levels are calculated by taking ma- 
trix elements of the effective Coulomb potential between the two hydrogen wave 
functions. The effective Coulomb potential can be obtained by considering higher 
order corrections to the Mott scattering amplitude. In addition to the vacuum po- 
larization effect, they also include vertex correction, together with the infrared and 
the anomalous magnetic moment effects, as we saw in Sect. 8.1.2. Altogether, the 
net effect on the Mott amplitude is given by 

W) \ Y ,U_J^( ln ^_l_l)\ + ^Li^3l] 1 {p) (8 . 8 5) 
KP) \f \ 3W V ix 8 5J\ in 2m J |,p [F > [ ' 

The first term in the braces {• • • } represents the first-order Mott scattering ampli- 
tude. The first and second terms in the parentheses (• • • ) are the vertex corrections, 
—1/5 is the vacuum polarization and the last term in the square brackets [• •■] is 
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the contribution due to the anomalous magnetic moment. The correction includes 
terms proportional to q 2 , which for the static field is V 2 acting on the space coor- 
dinate of the field. The operation of V 2 on the Coulomb potential produces <5 3 (0). 
Namely, only the wave function at the origin contributes, hence it affects the s state 
but not the p state. Therefore, the correction removes the degeneracy between s 1/2 
and jJi/2- In configuration space, the effective potential shift becomes [215] 

4a 2 / m 3 1 3\ ., a 2 ah 

AV eff = ( In + - 6\x) + —- 

3m 2 {p 8 5 S) W r* ^ 

(1) (2) (3) (4) (4') 

The vertex correction terms [(1) and (2)] are larger than the vacuum polarization 
effect (3) in the Lamb shift. The infrared cut is determined by extrapolating its 
contribution and connecting it smoothly with that obtained by nonrelativistic treat- 
ment, which is finite. The contribution from the anomalous magnetic moment 
[terms (4) and (4')[ can be calculated by extracting the lowest order matrix element 
of the effective Hamiltonian 

X = -K^P(x)\^o oi F Q ^\xp(x), k=^- (8.87) 

If we express the energy shift by its corresponding frequency (AE = hAv), the 
vertex correction amounts to 1010 MHz, the vacuum correction to —27.1 MHz, and 
the anomalous magnetic moment to 68 MHz giving a total of 1051 MHz which is 
accurate to about 6 MHz. Since then, higher order corrections have been calculated, 
so the accuracy is now ~ 0.01 MHz. 

Theory total 1057.8445 ± 0.0026 MHz [279] 
Experiment 1057.8450 ± 0.0090 MHz 

As one can see, the measurement is precise enough to test the correction. It was 
QED's first major success to convince people that it is the right theory for the elec- 
tromagnetic interactions of the electron. 



g-2 

In the radiative correction to the vertex, a term proportional to q v a" v appears. This 
is just a ■ V x A ~ fi ■ B and is a higher order correction to the magnetic mo- 
ment that alters the value of g — 2 from zero. The statement that the Dirac particle 
has g = 2 is no longer correct when higher order QED effects are included. We 
have calculated the first loop correction to the vertex and obtained a correction that 
amounts to ajlu [see Eq. (8.22)]. Calculations of even higher orders have since 
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been performed [219, 232] and have been compared with the experimental values: 
( ^— J = — - 0.328 478 444 002 90 (60) (-) 

+ 1.181234016828(19) (-) ( 8 - 88a ) 



- 1.9144(35) (-) + 0.0(3.8) (-) + • 



Inserting the value of the fine structure constant, the theoretical value can be com- 
pared with the experimental data: 

(1 159 652 180.85 ± 0.76) x 10" 12 experiment [219, 311] (8.88b) 

The experimental value nowadays is so accurate that it perfectly agrees with the 
theoretical value. In fact, the theoretical error is larger due to uncertainty in the 
fine structure constant. Therefore, the experimental value is used to determine the 
most accurate value of the fine structure constant [219]: 

a" 1 = 137.035 999 069 (96) [relative error = 0.70 ppb 7) ] (8.89) 

The experimental value of g — 2 is no longer a value to be tested but a standard. 
Similar calculations have also been made for the muon [311] 

|^^| =— +0.765 857 410 (-) + 24.050 509 (-) 
\ 2 >> ** W W (8.90a) 

+ 130.805 (-) ■■■ 



= (116 584 718.10 ± 0.16) x 10" 11 QED only 

(+154 ± 2) x 10 -11 electroweak 

(8.90b) 
(+6916 + 46) xlO" 11 hadronic 

= (116 591 788 ± 46) x 10" 11 theory total 

= (116 592 080 ± 54) x 10 -11 experiment (8.90c) 

The different value of the muon g — 2 reflects the difference of the mass. The 
error in the muon is much larger than that of the electron. This is because for the 
muon the hadronic as well as weak interaction correction is sizeable (Fig. 8.7b-e). 
The hadronic correction enters in the process in which a virtual photon becomes 
hadrons (i.e. quark pairs). The hadronic correction uses the high-energy e~ e+ — >• 
hadrons data. 

Although the procedure of subtracting infinity from infinity is aesthetically un- 
satisfactory, a theory that can reproduce the experimental result to the accuracy of 

7) ppb: parts per billion = 10 -9 . 
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Figure 8.7 Feynman diagrams for g - 2 of the muon. (a) Schwinger term [0{a) correction], 
(b) hadronic contribution, (c) hadronic light-by-light contribution, (d,e) lowest order W ± ,Z 
contribution. 

10 — 10 must have some truth in it. That we trust in the renormalization procedure 
has firm foundations experimentally. In fact, the confidence in the gauge theory 
starting from QED is so solid that any deviation of the experimental measurement 
of g— 2 is interpreted as an effect of unknown new physics. In fact, a small discrep- 
ancy from the theoretical prediction was detected in measurements of the g — 2 of 
the muon [281]. It could be an experimental error, but if real it could signal a possi- 
ble new physics, for instance, supersymmetry (see Chap. 19) and is one of the hot 
topics of the day (as of 2010). Nowadays, the precision measurement of a physical 
object to look for a possible deviation from the theoretical prediction is considered 
a valid and powerful method to search for new physics. 



8.2.3 

Limit of QED Applicability 

QED is a quantized version of the well-established classical Maxwell theory. At the 
macroscopic level, it can be applied to the magnetic field in the periphery of Jupiter 
and to the galactic magnetic field. Namely, it is valid at the scale of at least a few 
hundred light years. What about at the microscopic level? The fact that there was a 
limit to the applicability of classical electromagnetism at the turn of the 20th cen- 
tury led to the introduction of quantum theory. We can test the validity of QED at 
small distances or at high energy by performing e~ + e + -► pT + fi + and Bhabha 
scattering (e~ + e + scattering), etc. If we parametrize the difference of the electron 
propagator between the theory and the experiment by F = 1 + s/A 2 , where s is the 
total energy squared, we can estimate quantitatively the applicability limit of QED 
by the value of A . At the present experimental level, we see no deviation from QED. 
The present experimental lower limit of A is larger than 1 TeV. In other words, we 
know that QED is valid down to the length scale of 10 -20 m. Therefore, QED is 
valid in scales over the range of 40 orders of magnitude and is still extending its 
validity. QED's triumph is remarkable. 



8.2.4 

E821 BNL Experiment 



Although the measured value g— 2 of the electron i 
to define the prime fine structure constant a , the 



; much more precise and is used 
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effects other than QED, including the electroweak effect (i.e. the contribution of 
Feynman diagrams that contain W ± , Z°) and hadronic interactions and possibly 
new physics. The contribution of the physics of the energy scale M to g — 2 is 
proportional to (m/M) 2 and hence the effect is ~ (m u /m e ) 2 ~ 4.3 x 10 4 larger for 
the muon. Even though a ft = (g — 2)/2 is ~ 800 times less precise than a e , it is 
still ~ 50 times more sensitive to new physics. 

Since hadronic as well as weak effects on g — 2 can be calculated accurately using 
the now established Standard Model (SM), the deviation, if any, of a measurement 
from the prediction of SM could be a possible clue to new physics. For this reason, 
the precise measurement of g — 2 of the muon not only serves as a crucial test of 
QED but is also at the forefront of experimental research in high-energy physics. 
We describe briefly how to measure the value of g — 2 experimentally. 

Principle of the Measurement A charged elementary particle with half-integer 
spin has a magnetic dipole moment fi with spin s: 

M = 8s (^) s (8.91) 

where q = ± e is the charge of the particle and the proportionality constant g s 
is referred to as the Lande g-factor. In a homogeneous magnetic field where the 
magnetic energy is given by H — — fi ■ B, the spin receives a torque from the field, 
which is given by N = fi x B, and the spin makes a precessive motion on the axis 
parallel to the magnetic field. In the particle's rest frame the precession (Larmor) 
frequency is given by 

«** = *£ (8 ' 92) 

but when the particle is moving, a relativistic effect called Thomas precession is 
added and the net frequency is given by 

(8.93) 

The particle makes a circular orbit perpendicular to the magnetic field. Assuming 
v ■ B = 0, its angular velocity is given by the cyclotron frequency w c and the spin 
precession frequency relative to the orbit becomes 



Since, m (the muon mass) is known and the strength of the magnetic field can be 
fixed by experimental conditions, what we need to determine the anomaly a^ is 
to measure the spin precession frequency. But at the precision level of parts per 
billion (ppb), B and m have to be measured with full care, too. 
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Thomas precession may be understood as follows. Imagine an airplane flying 
in a circular orbit. Its motion may be considered as a series of longitudinal 
advances by dl\\ ~ rAQ accompanied by a transverse shift dl± = A0dl\\. 
Inside the airplane (variables denoted by a prime), the longitudinal distance is 
Lorentz contracted (dl'^ = dl\\/y), but the transverse distance is not (dl' ± = 
dl±), giving the angular shift (Ad' = y Ad). When the airplane comes back 
to the original point, the rotation angle in the airplane frame (comoving frame) 
has an excess 2ji(y — 1) relative to the ground frame. Therefore, the rate of 
extra advance rotation per period (angular frequency) is given by 

^- = ^7^ = y - 1 (8 ' 95) 

This is a shift of a reference axis with which the rotation angle of a spin- 
ning object is measured. The advance shift of the reference means the spin 
precession frequency of a muon moving in a circular orbit with the veloci- 
ty v = 2nrw in the laboratory frame is smaller by y — 1. Since the circular 
motion of the muon is due to the Lorentz force of the magnetic field and its an- 
gular frequency (cyclotron frequency co c ) itself becomes smaller by y because 
of the relativistic mass increase, the net change in the angular frequency due 
to the Thomas precession becomes Aco-rhom^ = —(qB/m)(y — l)/y. 
Note: The Thomas precession is a purely kinematical effect independent of 
the dynamics of the spinning object and is common to objects in quantum 
mechanics and classical tops or gyroscopes. 



Experimental Method 

(i) Measurement of cu a : Firstly, we have to prepare a large enough number of po- 
larized muons and a method to measure their direction. Polarized muons can be 
obtained by collecting decay muons emitted in the forward direction of flying pio- 
ns. The weak interaction mediated by the gauge boson W acts only on the left- 
handed particles. Because of this, the decay neutrino from ji + in the reaction 



ft+ -+ e+ + v e + v„ (8.96b) 

has helicity —1 and by angular momentum conservation fi^~ has also helicity — 1, 
namely is 100% polarized in the pion rest frame. Consequently, muons emitted in 
the forward direction in the laboratory frame along the high-energy pion beam are 
also highly polarized. The left-handed nature of the weak interaction also works in 
muon decay and e ± are preferentially emitted antiparallel (parallel) to the polar- 
ization of [i ± . Therefore the electron intensity at a fixed direction relative to the 
muon momentum is modulated by the muon spin precession (see Fig. 8.8). Name- 
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ly, the muon has a built-in polarization analyzer. The modulation frequency directly 
reflects <w a inEq. (8.94). 

(ii) Measurement of B: The magnetic field B is measured by observing the Larmor 
frequency of stationary protons, co p = g p (eB/2m p ), in a nuclear magnetic reso- 
nance (NMR) probe. This requires the value of fi^lfip to extract a^ from the data. 
Fortunately, it can be determined precisely from the measurement of the hyperfine 
structure in muonium {fi-t bound state). From Eq. (8.94) one obtains 



n= — 



(8.97) 



(iii) Focusing of the muon beam: To store muons in a circular orbit and retain them 
for the duration of the measurement, focusing of the beam is necessary. This can 
be achieved by making gradients on the magnetic field, but then the homogene- 
ity of the field is sacrificed, which is the most critical factor in the experiment to 
obtain high precision at ppb level. Therefore, E821 (and also previous CERN exper- 
iments [36]) used an electrostatic quadrupole field focusing. With the presence of 
the electric field, the precession frequency is modified to [216, 276] 

* r „ (. i \ .. j 



For y mag ic = 29.3 (p^ = 3.09 GeV/c), the second term vanishes and a fl reduces to 
Eq. (8.94) again. 

Figure 8.9a shows schematically the muon injection and storage in the g — 2 
ring. The trajectory of the injected beam is modified by three kicker modules to 
become a circle that stays in the ring. The spin precesses ~ 12° /circle (Fig. 8.9b). 
The decay electrons are measured by 24 lead-scintillator calorimeters distributed 
along the inner circle of the ring. Figure 8.8 shows the measured distribution of 
electron counts. The result gives 

a^P = 1.165 920 80 (63) x 10 -3 (8.99) 



Figure 8.8 Distribution of elec 
data-taking period. The data ai 



;rsus time for 3.6 x 10 9 muon decays in the R01 fT 
jund modulo 100 u,s. Data from [281]. 
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(a) 

Figure 8.9 (a) Schematic diagram of the E821 
muon storage ring [281]. Polarized muons 
with ~ 3.1 GeV/c enter the storage ring 
through a field-free channel. The three kick- 
er modulators provide a short current pulse, 
which gives the muon bunch a transverse 
10 mrad kick and places it in the ring orbit. 
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where the number in parenthesis denotes error in the last t\ 
mental value is compared with the theoretical prediction 



fl theor _ L165 91793 ( 6g j x 1Q - 



which is 3.2a (a = standard deviation) off. Whether the deviation is a possible sign 
of new physics is under much debate. 



9 
Symmetries 



When one does not know the principles or the fundamental equations that govern 
the phenomena one observes, one often obtains clues to them by looking at the 
symmetries or selection rules they exhibit. Historically, Maxwell constructed his 
unified theory of electromagnetism by assuming the existence of the displacement 
current, which was required by current conservation. Discovery of special relativity 
was inspired by Lorentz invariance of Maxwell's equations. History also shows that 
disclosed symmetries, more often than not, turn out to be approximate. Isospin and 
flavor S 1/(3), which led to the discovery of the quark model, are such examples. To- 
day what people believe to be strictly conserved are based on gauge symmetries. 
Energy-momentum, angular momentum and charge conservations are in this cat- 
egory. The conservation of baryon and lepton numbers, although phenomenolog- 
ically valid, are not considered as strict laws. Their mixing is an essential feature 
of the grand unified theories (GUTs). Symmetries and conservation laws are inti- 
mately connected. Whether based on gauge symmetry or not, apparent symmetries 
and conservation laws can be powerful tools for discovering new physics. 

Symmetry means one cannot tell the difference before and after a certain trans- 
formation. For the case of continuous space-time translation, this means there 
is no special position or absolute coordinate system in setting space-time coordi- 
nates for describing a physical phenomenon. The same experiments done at two 
different places (say in Tokyo and in New York) or done at different times (today 
or tomorrow) should produce the same result. In other words, translational in- 
variance means the existence of certain nonobservables, i.e. absolute position and 
time. So far, we have elaborated on the relation between space-time translation- 
al symmetry and the resultant energy-momentum conservation. However, it is a 
common feature of symmetry. Similar relations can be derived for a variety of sym- 
metry transformations. We list some of them here in Table 9.1 before proceeding 
further. 

Symmetries come in two categories, first, those related to space-time structure, 
which are often referred to as external, and second, those related to internal sym- 
metries, such as electric charge, isospin and color charge. Mathematically speak- 
ing, the latter leave the Lagrangian density invariant after the symmetry operation 
apart from a total derivative. It gives a surface integral that vanishes under normal 
circumstances. Space-time symmetries change the coordinate values, so only the 

Elementary Particle Physics, Volume 1: Quantum Field Theory and Particles. Yorikiyo Nagashima 
"" ' ; GmbH & Co. KGaA, Weinheim 
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Table 9.1 Example symmetries and 



Nonobservables 


Symmetry 


Conserved observables 




transformation 


or selection rules 


Space-time symmetry 






Absolute position 


x^-x + a 


Momentum 


Absolute time 


t-*t+b 


Energy 


Absolute direction 


e -^0 + a 


Angular momentum 


Absolute velocity 


Lorentz transformation 


Lorentz invariance 


Right or left 


x^-x 




Internal symmetry 






Particle identity 


Permutation 


Bose-Einstein 

or Femi-Dirac statistics 


Charge + or — 


Q^-Q 


C invariance 


Absolute phase 


<p -*■ e-'&y 


Charge 


- among quarks 


S L/(3) gauge transformation 


Color charge 


Approximate symmetry 






- among leptons a 


n>-+e~' La n> 


Lepton number 


- among quarks 


V^-e- iBa ip 


Baryon number 


Near equal mass of u, d 


Flavor S [7(2) 


Isospin 


- of u,d,s 


Flavor S U(3) 


Unitary spin 



Phenomenologically, lepton and bar; 



action, which is an integral of a Lagrangian 
ant. In other classifications, they are either 
continuous space-time symmetries first. 



space-time, i 

discrete. We discuss the 



Continuous Symmetries 

Where there is a symmetry, there exists a corresponding conservation law. What 
this means is that when an equation of motion or the Lagrangian is invariant under 
some symmetry operation, there exists a physical observable that does not change 
as a function of time. We have already shown some examples in Sect. 5.1.4 in the 
form of Noether's theorem. Here, we start by comparing how the symmetry and the 
invariance of the equation of motion are related in classical, quantum mechanical 
and quantum field theoretical treatments. 
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9.1.1 

Space and Time Translation 

Classical Mechanics In the Lagrangian formalism, the equation of motion is given 
as the Euler-Lagrange formula 



d ( dL(q i ,q i ) \ 31(9;,^) _ 
dt\ d qi ) dq t 



If the Lagrangian is translationally invariant, namely, 

L{ qi ) = L(qi + at) (9.2) 

or equivalently dL/dqj = 0, the Euler-Lagrange formula Eq. (9.1) tells us that the 
conjugate momentum defined by 

91 
Pi = «t- (9.3) 

dqt 

is a constant of time. Therefore, translational invariance in space coordinates leads 
to the existence of the conjugate momentum, which is time invariant. Since the 
Lagrangian formalism is valid in a general coordinate system, this statement is 
general. If qi is an angular variable, the conservation of angular momentum fol- 
lows. The time derivative of the Hamiltonian is expressed as 

dH df.dL \ ..91 .d(dl\ fdL 91. 9L\ 









where the Euler-Lagrange equation was used to arrive at the last equation. The 
above expression tells us that the Hamiltonian is a constant of motion if the La- 
grangian does not contain time variables explicitly. Namely, translational invariance 
in time leads to energy conservation. 

Quantum Mechanics Observables in quantum mechanics are expressed as matrix 
elements of hermitian operators. If there is a unitary operator U 1 ' that does not 
have explicit time dependence, and if the transformed wave function xp' = Uip 
obeys the same Schrodinger equation, then 

i — if, = Hij> ^ i—Uip = U HIT 1 Uip 

Bt at (95) 

.-. i — jp'=Hip'^UHU~ 1 = H or [H, U] = 

1) The symmetry operation has to be unitary because the transition probability of the state before 
and after symmetry transformation has to be the same. ((V/|V'> = M&UM = (VlV»- 
We attach a hat * to differentiate operators from numbers when we discuss objects in quantum 
mechanics. 



224 I 9 Symmetries 

The operator U commutes with the Hamiltonian. If U is a continuous function of 
some parameter e and U -*■ 1 when e ->■ 0, then for infinitesimal e 

U~l-ieQ (9.6) 

As U is unitary, Qis hermitian. The operator Qin U = e _, 2 a , where a is a contin- 
uous parameter, is called the generator of the transformation. Since, the hermitian 
operator in quantum mechanics is an observable, Q is an observable. As Q com- 
mutes with the Hamiltonian, the time derivative of the expectation value is given 
by 

^ = j- t MQM = iMlH,Q}M=0 (9.7) 

Therefore, if there is a continuous symmetry represented by a unitary operator U 
that leaves the equation of motion invariant, its generator Q is a constant of time. 
The hamiltonian is a time translation operator. This is obvious from 



To find the corresponding space translation operator U(a) = e "*2 that is defined 
by 

ip(x)^ip(x + a)=lJ(a)ip(x) (9.9a) 

we make the displacement infinitesimal: 

U(€)tp(x) = [l-ieQ+ 0(e 2 )]xp(x) = xp(x + 6) 

= ip(x) + e—ip(x)+0(G 2 ) (9 . 9b) 

^ dx F 

A Lorentz invariant expression can be obtained by combining Eqs. (9.8) and (9.9): 

^(x" + a") = e -' a " p /^(x") (9.10a) 

p /( = j(9 0j V) (9.10b) 

Problem 9.1 

For a translational operator p where O(x) is a function of p and x: 

0(x + a) = e la PO(x)e- ,a r (9.11a) 

where O(x) is a function of p and x, prove 

VO(x) = i[p,0(x)] (9.11b) 
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Translational Invariance in Quantum Field Theory 

Covariant version of the Heisenberg equation of motion is expressed as 

d fl O = i[P flr O] (9.12a) 

0(x + a) = e i *" p >-0(x) e - ia " p >- (9.12b) 

The field theoretical version of the energy momentum operator is given by 

Pf = f d 3 xT''° (9.13) 

where T"" is the energy-momentum tensor that can be derived from the La- 
grangian using Noether's theorem. Here, P 1 * and O(x) are functions of the 
field operators. Note, corresponding to the negative sign of space variables in 
the Lorentz-invariant expression (a' ( P fl = a°P° — a ■ P), P fl = (P°, — P) is used. 

We first prove that Eq. (9.12b) is equivalent to the Heisenberg equation (9.12a). 
To prove the equivalence, it is enough to show that one can be derived from the 
other and vice versa. Consider a fl in Eq. (9.12b) as a small number e'' , expand both 
sides of the equation and compare terms of 0(e): 

(1 + iE u P fl )0(x)(l- ief'Pf,) ~ 0(x) + e u d /t O (9.14a) 

.-. i[P fl ,0(x)] = d fl O(x) (9.14b) 

Conversely, to derive Eq. (9.12b) from Eq. (9.12a), we use the Baker-Campbell- 
Hausdorff (BCH) formula. 



Baker-Campbell-Hausdorff formulae Let A, B be operators. 

Theorem 9.1 

e A Be -A = b + [A, B}+ -[A, [A, B]] + ..- 

nA's ' (9-15) 

+ i[A,[A,[A,"-[A,B]]]] + ... 

Corollary 9.1 

[B, e~ A ] = e~ A ([A, B] + ±[A, [A, B]] + ■ ■ ■) (9.16a) 

[e A , B] = ([A, B] + \[A, [A, B]\ + • • • ) e A (9.16b) 
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Theorem 9.2 






Assuming [A, [A, B]] = [B, [B, A}] = 






^ = e -^w = e v e - 


j[AB] 


(9.17a) 


e v = w^ 




(9.17b) 


Corollary 9.2 






**W*=*H" 




(9.18) 



Problem 9.2 

Prove Theorem 9.1 and Theorem 9.2 

Replace A by iP,,fl'', B by O, and make use of Eq. (9.12a), then 

e iP '" 0(x)e- iP " a " = 0(*) + a"9 / ,0(*) + ---+^9< n) 0(%) + ... 

= 0(* + a) 

It remains to prove Eqs. (9.12) with P fl expressed as Eq. (9.13). We consider the 
case of a complex scalar field. Generalization to other fields is straightforward. The 
energy-momentum operator is given in Eq. (5.93). 

H = / d ' x [(?r) © + (W<p1 ' v<p) + m2<pt<p ] (9 - 20a) 

P = -f d i x^ r V < p + V<p< ^J (9.20b) 

Using the equal time commutation relation 

[?(*). x(Y)]t,- h = [*(*). V ' (Y)h,= h = id\x - y) (9.21) 

it is easy to verify Eq. (9.12a) if O = <p(x) or O = n(x). 
Then for any operators A, B that satisfy 

d u A = i[P„, A] , d fl B = i[P /u B] (9.22a) 

i[P fl , AB] = tfP^.AJB + Ai[P u , B] = (S^AJB + Ad u B = 3„(AB) (9.22b) 

it follows that the equation is valid if (AB) is replaced with any polynomials of 
A and B. Since P^ is a conserved operator that does not include any space-time 
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coordinates, applying any function of space-time differentiation g(d) on both sides 
of the equation gives 

i[P /l ,{g(d)0(A, B)}] = d fl {g(d)0(A, B)} (9.23) 

Therefore, if O is a local operator, i.e. a polynomial of <p(x), jt(x) and their differen- 
tials, it satisfies 

d fl O = i[P fl O] (9.24) 

q.e.d. 
In summary, the notion that for the space- time translational symmetry there 
ists a corresponding conserved quantity, the energy-momentum, is valid in qu; 
turn field theory as it was in classical field theory. In quantum field theory, the i 
ergy-momentum operators are the generators of the translational transformation. 
Note also that the Heisenberg equation holds even if the fields are interacting, be- 
cause the equal time commutation relation, which was the basis of the proof, is 
also valid for interacting fields. The validity is ensured by the fact that they are 
connected by a unitary transformation [see Eq. (6.7)]. 

9.1.2 

Rotational Invariance in the Two-Body System 

Discussions in the previous section have shown that energy-momentum conser- 
vation is a result of translational invariance in Cartesian coordinates. As the La- 
grangian formalism is valid for a general coordinate system, translational invari- 
ance in polar coordinates (p ->■ + a results in another conserved quantity, the 
angular momentum. Expressed in the original Cartesian coordinates 

V<t> = -»30 = -»(*3, - y3x) = rx- = l z (9.25) 

In Chap. 3 we have shown that Lorentz invariance leads to the existence of spin an- 
gular momentum, which acts on spin components of the field and is an intrinsic 
property of particles, the quanta of the field. Determining the spin is an essential 
step in the study of elementary particle physics and the method is based on the 
consideration of rotational invariance. Experimentally, the most frequently used 
reactions are two-body scatterings. If rotational invariance holds, angular momen- 
tum is conserved. Then the scattering amplitude can be decomposed into partial 
waves of definite angular momentum, and by analyzing the angular distributions 
of the two-body scattering state the spin of the system can be determined. We have 
already learned in quantum mechanics that the scattering amplitude in the center 
of mass (CM) frame can be expanded in partial waves: 

^ = \f(°f < 9 - 26a ) 

/< ) = T~ E< 2 -/ + VI s J W ~ yPji ) < 9 - 26b ) 
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where p is the momentum in CM, SJ(E) is the partial element of the scattering 
matrix with angular momentum J and Pj{8) is the Legendre function of cos 6 . 
The above formula is valid for a spinless particle. It can be extended to particles 
having spin [217]. For the treatment, it is convenient to work in the eigenstates 
where the incoming and outgoing states are specified by their helicity 

instead of J z , which is a component of the angular momentum along the fixed 
z-axis. The helicity is by its definition rotationally invariant. 

What we are dealing with is a two-particle system having momentum and helicity 
p v l\ and p 2 , X 2 , respectively: 

\W) = \ Pl Xi;p 2 X 2 ) = Vi(p v Xi)ip2(p 2 ,X2) (9.28) 

Let us first separate the motion of the two-body reaction 

A( Pl ) + B(p 2 ) -> C(p 3 ) + D(p 4 ) (9.29) 

into that of the CM frame itself and the relative motion in it. Defining the mo- 
mentum of the CM system P and the relative momentum p in the initial and final 
states by 

p i= Pi + P 2 ~ p f= Pi + PA (9.30a) 

P,= \{Pi-Pi)' P f = \(P i -P,) (9.30b) 

and working in the CM frame, in which P; = Pj — (£,0), state vectors in CM can 
be expressed by their relative momentum in polar coordinates. We rewrite the two- 
particle state as 

\p6<p;X 1 X 2 )=y> 1 (pX 1 )ip 2 (pX 2 ) (9.31) 

where particle 1 has momentum p = (p, 9, (p) in polar coordinates and helicity l\ 
and particle 2 has — p and helicity l 2 . Sometimes we will use the total helicity 

X = Xi-X 2 , [t = X 3 -X 4 (9.32) 

to denote the two-particle helicity state, i.e. \X) = \AiX 2 ), when there is no danger 
of confusion. Expanding the plane-wave state in terms of those that have angular 
momentum J, M is the center of discussion. 

If rotational invariance holds, angular momentum is conserved and the scatter- 
ing matrix elements can be expressed as 

{j'M'XiX^SlJMXiXi) = d ]r d MM/ (l i l 4 \S J \A 1 A 2 ) (9.33) 
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where | JMAiA 2 ) is a state of definite angular momentum constructed out of states 
containing particles of definite helicity. The essence of the partial-wave expansion 
is condensed in this equality. 

We note that the final state, where the particle is scattered by angle 8 , can be 
obtained by rotation from the initial state, where the momentum is along the z- 
axis (8 = <j> = 0). The rotation in this case is around the y-axis by 8 and then 
around the z-axis by <p : 

\p8<p;X) = e~ i]z4, e~ i] i e \p(Xi;k) = R(d, 0)|pOO; A) (9.34) 

where R(8,<p) denotes the rotation operator. Conventionally the alternative form 

R(0) = e - i ]z<P e - i ]y s e i ]z<l> (9.35) 

is often used, which we will adopt here. It differs from Eq. (9.34) only in the phase. 
Now we define the rotation matrix by 

<J'M'A|R(0,0)| JMA) = d jr e-^ M '- M ^{JM'X\e- i0 ^\JMX) 

— A .-im'-M)tj> j] /oi ' ' ' 



The two states have the same helicity, as it is conserved by rotation. The rotation 
matrix d ] MM ,(8) satisfies the orthogonality condition and is normalized by 



f dcosdd ] M M ,(d)d 1 MM ,(8) = d, r — — 



General properties and expressions for d } M , M (8 ) are given in the Appendix E. Mul- 
tiplying Eq. (9.34) by (JM;1\ from the left gives 

{JMX\pd<p;X) = ^(JMA|R(0,0)|J'M')(/M'|j>OO;A) 

J'M' 

= ^e''( M '- M ^d^ M ,(0)(;M'A|]jOO;A> 

= e^- M ^d J MA (8)(JMA\p00a} 

= Nje^- M ^d J m (9) (9.38) 

The penultimate equality follows because the momentum of the state \pQ0; A) is 
along the z axis and its total helicity A = Ai — A 2 is exactly the same as M = J z , 
hence 

(JM'i\p00;X} = 6 M f X Nj (9.39) 
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Nj is calculated to be 

»j = jm!r (9 - 40) 

To derive Nj, we use the normalization conditions 

(j'M'X'\JMX) = 8]j'dMM'6}\Xi&X l Xz ( 9 - 41a ) 

(p'd'(p';X'\pd(p;X) = 5(cos0 -cosd')d(<p - (p')d^ Xl d k ^ 2 (9.41b) 

and apply them to 

1 = {]MX\]MX) = J2 I dcos6d(/)(JMX\p6(j);fi)(p6(p\n\JMX) 
^^^\UMX\p^X)? = ^N] (9.41c) 

Next we expand the state Eq. (9.34) in terms of angular momentum eigenstates: 

\ P 8<pX) = ^\JMX){JMX\p6(P\X) = J2 N J d Lx el[ ''~ M) ' P \J MA ) 

J,M JM 

(9.42) 

Then the scattering matrix between the two states expressed in polar coordinates 
can be written 

{p f e4>;/i\S\pi00;X) 

( = 3) E ^ (^- M) "<Y MS J (E )W (.Wdia) U=,=o 

{fi\S J (E)\X} = (XiXi\S J (E)\kiX z ) , X = X 1 -X 2 , /j, = X 3 -X 4 

(9.43) 

To normalize the scattering amplitude, we compare the scattering matrix in linear 
n space and in polar coordinates in the CM system: 

(p 3 p 4 ; f i\S\p 1 p 2 ;X} = d if -(2jt)Hd 4 (j2P'-J2Pf) M f- 

= (2tt)V(P ; - P 7 ) X (47T) 2 l-L- 

x(p f e<t>;p\S\pmX) (9.44) 
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where s = (pi + pi) 2 - The factor in the second line results from the normalization 
conditions Eq. (9.41). To prove it, set S = 1 and use 

(piP^PiPi) = (27tf(16E 1 E 2 E 3 E 4 f 2 d 3 (p 1 -p 3 )d 3 (p 2 -p 4 ) 

<5 3 (j>i ~ Pi)& 3 (P 2 ~ p 4 )d 3 P3d 3 p4 = d\P, - P f )d i (p l - p f )d 3 P f d 3 Pf 

/9E7\ _1 , , 
= <5 4 (P ; - P f )d(cosd -cos6')d((p-<p')l —±\ p f 2 d 4 P f dcosdd(p 



and symmetrize between the initial and the final state. We define the 
matrix T^(£)by 2) 

(l 3 ^4|S ; (£)|lil 2 > = <5jM3<5jW4 + i<A 3 A 4 |T- f (£)|A 1 A2> (9.46) 

Then using Eq. (9.44) and comparing with the cross section formula (6.90) for 
da/dQ | cm , we obtain 

£H/„W,e (9.47,, 



(9.47b) 

= ^7 E( 2 J + W^ (S-'(fi) - 1) 1^1^2)^(6)^-")* 

(9.47c) 

The equation is an extension of Eq. (9.26). In fact, referring to Eq. (E.21) in Ap- 
pendix E, 

4, = Pj(B) (9-48) 

This agrees with Eq. (9.26) when the particles have no spin. 

Unitarity of S-matrix 

From the unitarity of the scattering matrix we have 

SSf=S t S = l (9.49) 

Sandwiching it between an initial and final state gives 



In terms of the T matrix defined in Eq. (9.46) 
i(Kf-Tft) = T, T Zf T « 

2) Tl = (Si - l)/2i is also often used in the literature. 
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If it is a helicity amplitude, time reversal invariance requires S/; = S;/ 
[Eq. (9.121)], 

2lm(T fi ) = Y, T *f T »> < 9 - 52 ) 

For forward scattering (i = f), 

2Im(T u ) = £|T nl | 2 (953) 

The right hand side (rhs) is proportional to the total cross section and the left hand 
side (lhs) to the imaginary part of the forward scattering amplitude. The relation is 
general in the sense that it is valid for inelastic as well as elastic scattering. Namely, 

cr T oT=fcIm[/(0 =0)] (9.54) 

To obtain the proportionality constant, we use Eq. (9.47) and 

T fi = -{27if6\Pi - Pf)M fi (9.55) 

Referring to the cross section formula in terms of Mft [see (6.86)] 

E, , \M fi \ 2 HmlMfi) 
(2ji) 4 d 4 (Pi - P f ) ' ; = K —4^- 



where we have used the relation p cm = ^(s, m\, m\)l(2j~s) = */sA(l, xjx 2 )/2 
[Eq. (6.56)]. 

Problem 9.3 

Using Eq. (9.47), calculate the total cross section for the elastic scattering 

o TOT = f dQj^lftiMMhl 2 (9-57) 

and then prove that 

oror(h, Xi) = — Imifx^Mh) (9-58) 

assuming there is no inelastic scattering. 
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Discrete Symmetries 

Discrete symmetries are not functions of a continuous parameter and no infinitesi- 
mal variation or differentiation is possible. Parity transformation, time reversal and 
charge conjugation to exchange particles and antiparticles are such examples. 

9.2.1 

Parity Transformation 

General Properties Parity, or P operation for short, is often referred to as "mirror- 
ing" (reflection) of an object, but the exact definition is reversal of signs on all space 
coordinates: 



Performing the parity transformation again brings the state back to the original 
one, that is P 2 = 1. Therefore the corresponding unitary transformation U P satis- 
fies 

Up Up = 1 .-. Up = Up 1 = Ul (9.60) 

The parity transformation is at the same time unitary and hermitian, hence an ob- 
servable by itself. Its eigenvalue is ±1. It is known that parity is violated maximally 
in the weak interaction, but that it is conserved in the strong and electromagnetic 
interactions, at least within experimental limits. 

Momentum and Angular Momentum There are many observables that have their 
own parity. Classically, the momentum p is mv = mdx/dt and quantum me- 
chanically — iV; it changes its sign under parity transformation. Orbital angular 
momentum, on the other hand, has positive parity since it is given by x x p. The 
parity of the spin angular momentum S is not clear, but we can assume it has the 
same property as its orbital counterpart. The above examples show that there are 
two kinds of (three-dimensional space) vectors. Those that change their sign un- 
der the parity transformation are called polar vectors and the others, which do not, 
axial vectors. Similarly, a quantity that is rotationally invariant but changes its sign 
under the P operation is called a pseudoscalar. One example is the helicity operator 
a • p/\p\, the spin component along the momentum direction. 

Intrinsic Parity of a Particle When the Hamiltonian is a function of the distance 



f- + V(r) (9.61) 

under parity operation, and if tp(x) is a solution then xp(— x) is also a 
solution: 

ip'(-x) = U P ip(x) = ±ip(x) (9.62) 



234 I 9 Symmetries 

Generally, a particle has its own intrinsic parity. When operated on a one particle 

i 

P\q) = ± Vp \Pq) (9.63) 

Here, q is a quantum number that specifies the state, and r] P is a phase that accom- 
panies the parity transformation. The phase degree of freedom is allowed because 
the expectation value of an observable is always sandwiched by a state and its con- 
jugate. Usually, the phase is taken to be 1 for simplicity. 

Parity of Fields The parity of the Dirac bilinears was already given in Sect. 4.3.3. 
The parity of the scalar, vector and tensor fields are the same as the corresponding 
Dirac bilinears. We reproduce in Table 9.2 the parity transformation properties of 
the fields. 

Parity of the Photon Now let us consider the parity of the photon. Since the time 
component of a charged current (p = j °) is a scalar with positive parity and the 
space component is a polar vector, the Maxwell equations 

8B 
V E = qp, — = -V x E (9.64) 

tell us that 

E(x)^E'(-x) = -E(x), B(x)^B'(-x)=B(x) (9.65) 

Since the electromagnetic potential is connected to the field by 

dA 
E = V<p- — , B = V xA (9.66) 

the transformation property of the potential is given by 

0(x) -►*'(-*) = *(*). A[x) -+ A'(-x) = -A[x) (9.67) 

This means the photon is a polar vector and has negative parity. 

Table 9.2 Property of Dirac bilinears under parity transformation. 

S[t,x) P[t,x) V[t,x) AV&x) T' ,v (t,x) 

P S(t,-x) -P(t,-x) V fl (t,-x) -A fl (t,-x) T fl „(t,-x) 
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Parity of Many-Particle Systems 

The quantum number of a many-particle system is additive for those correspond- 
ing to continuous symmetry and multiplicative for those corresponding to discrete 
symmetry, provided they are independent. The wave function of a many-particle 
system when there are no interactions between the particles is described by 

V{x l ,x 1 ,--- ,x n ) = ip l (x l )ip 1 (x 1 )---y n (x n ) (9.68a) 

W' = UW = ip[ip' 2 ---ip' n = UipiUip 2 ---Uip n (x n ) (9.68b) 

When (J= e'2" 

(9.69) 

When the transformation is discrete, U itself is converted to its quantum number, 
the rhs becomes nr/illtp n and 

n = ViV2 — Vn (9-70) 

Parity of a Two-Body System 

Consider the case where the intrinsic parity of spinless particles 1, 2 is P u P 2 and 
the wave function of the relative motion is <P(x) = f(r)Y LM [6,<p), where Y LM is 
a spherical harmonic function. Under the parity operation 8 ->• ji — 6 and <p -*■ 
4> + Jt. Using the property of the spherical harmonic function Y LM (ji — 8,(p + jz) — 
(-1) L Y LM (0,^), 

UW( Xl ,x 2 ) = U[cp l (x l )0(x l -x 2 )^ 2 (x 2 )] = P 1 P 2 (-1) I </' (9.71) 

The parity of a three-body system can be determined similarly as P\P 2 Pi{— l) l ~*~ L , 
where I is the relative angular momentum between the particles 1, 2 and I is that 
of the particle 3 relative to the CM of the 1 + 2 system. Now consider a reaction 
a + b ->■ c + d, which has relative angular momentum 1,1' before and after the 
reaction, respectively: 

(cd\S\ab) = (cd\p- 1 PSp- l P\ab) = P a P h (-l) 1 P c P d (-lf (cd\S\ab) 
:. {l-P a P h {-l) l P c Pd(-lf}{cd\S\ab)=0 (9.72) 

If (cd\S\ab) / and [P, S] = 0, the parity before and after the reaction is con- 
served: 

P a Pb(-lf = P c Pd(-lf (9.73) 

When the parity of the particles a, b, c is known, that of d can be determined using 
the above equation. We shall determine the parity of the pion in the next section 
this way. Equation (9.73) does not hold if (cd\S\ab) = by some other selection 
rules. Therefore, there are as many ambiguities in the determination of the parity 
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assignment as the number of selection rules. For instance, to determine the parity 
of jt°, one can use a process p -» p + jt° or n -» n + jt°, but to determine that of 
jr+ one cannot use p -> p + ji + , because it is forbidden by charge conservation. 
The process p -*■ n + ji + is allowed, but one needs to know the relative parity of 
n to p. Similarly one cannot determine the parity of K + using the process p -*■ 
n + K + as it is forbidden by strangeness conservation, p ->- A + K + is allowed, 
then again the parity of K~*~ and A cannot be determined independently. The usual 
assumption is that all the quarks u, d, s, etc. have relatively positive parities. Since 
all the hadrons are made of quarks, the parities of the hadrons can be determined in 
principle once the relative angular momentum among the quarks is specified. The 
parity of the familiar baryons (p,n,A,---), therefore, is assumed to be positive. 

Parity Transformation of Helicity States* 

The parity transformation property of the helicity amplitude is a bit complicated, 
because it does not use the orbital angular momentum explicitly. Those who are 
not interested in the complication of the derivation may skip this part and use only 
the relevant formula Eq. (9.87). The following arguments follow closely those of 
Jacob and Wick [217]. 

To determine the parity of a partial wave state \JMX) in the helicity formalism, 
we must go back to the beginning and evaluate the effect on the helicity states 
Eq. (9.31) 

\p00hh) = VxlplMph) (9.74) 

where particles 1 and 2 are moving in +z and — z directions, respectively. xp\ is 
obtained by Lorentz boost from its rest frame. Since the eigenvalues of the pari- 
ty operation are only phases, we need to fix the relative phases of various states 
beforehand. They can be determined from a requirement that i/;(0A) satisfies the 
usual angular momentum formula 

J±\sX) = (;, ± ] y )\sl) = [(s T- l)(s ±1 + l)] 1/2 ls, I ± 1) (9.75) 

The parity operation does not change the spin direction and at rest ] z — I 

Pip(01) = t]ip[0l) (9.76) 

where r] is the particle's intrinsic parity. It is convenient to define a mirror operation 
with respect to the xz-plane: 

Y = e -i*hp (9.77) 

Using the relations 

e- i0 Jy\JMl) = ^d 3 M , M (0)\JM'l) (9.78a) 

w 
d } M , M (Jt) = (-iy +M '8 M ,,- M (9.78b) 
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we obtain 

Yipriplr) = Vii-^Y 1 ' 11 MP. -Ai) (9-79) 

Similarly 

Y^ 2 (pA 2 ) = 72(-l)* 2+il V2(p.-A 2 ) (9.80) 

The reason for — A 2 in the exponent is that the particle's momentum is —p, hence 
M = J z = —A. Combining the two equations 

Y|p00l!A 2 > = »7 1 ^ 2 (-l) SI+S2_;ll +' l2 ||?00,-Ai,-2 2 ) (9.81a) 

.-. P\p00X 1 X 2 ) = rj l f] 2 (-iy i+S2 ~ Xl+h e ij ' } '\p00,-X l ,-X 2 ) (9.81b) 
Now, what we want is the phase that appears in 

P\JMX l X 2 ) = e lu \JM,-X 1 ,-X 2 ) (9.82) 



Since the parity and the angular momentum operators commute, the phase a does 
not depend on M. Therefore, without loss of generality, we can set M = X = 
Xi — X 2 . For the same reason, we can confine our argument to the case 9 — (p = 0. 
Expanding the rhs in Eq. (9.81b) in partial waves 

j*h\pO0,-X 1 ,-X 2 )= J2 E li M ^> 

JMu lf <2j'M' V1V1 

x(JM^ lf i 2 \e i "Jy\j'M'v 1 v 2 )(j'M'v 1 v 2 \pOO,-X l ,-X 2 ) 

JM 

= T l N J (-lf-J\JX,-X 1 -X 2 ) 

J 

Inserting this into Eq. (9.81b) gives 

P\p00X 1 X 2 ) = J2 KphV2(-l) sl+sl - ] \Jl,-h,-l2) (9.84) 

J 

On the other hand, expanding the lhs of Eq. (9.81b) directly using Eq. (9.42) gives 
P\pOOXiX 2 ) = J2 P\JMX l X 2 ){]MX l X 2 \pWX l X 2 ) 

J Jt (9-85) 

= J2 N J P U U ^ 
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Comparing Eq. (9.84) and Eq. (9.85) 

P\JU 1 X 2 ) = Tntoi-lF+n-JUX-h-Xj) (9.86) 

Noting Si + s 2 — J is an integer, we finally obtain 

P\]XX l l 2 ) = *7i>72(-l) J ~ SI ~ S2 I.M, -^1,-^2) (9.87) 

When the parity is conserved, applying Eq. (9.87) to the scattering matrix, 

(-li,-X A \S J \-li,-l z ) = Vs{hh\S ] \Xil 2 ) 

Using Eq. (9.47) and d{ m (n - 6) = (-\) }+n d{- m (0), a similar formula for the 
scattering amplitude can be obtained: 

{-h,-h\f{0,<t>)\-h,-^2) = V S {^h\f{0,n-<t>)\X l X 1 ) (9.89) 

Parity Violation in the Weak Interaction 

We will describe the detailed dynamics of the weak interaction in Chap. 15. Here, 
we describe only the essence of parity violation. The momentum p is an observable 
with negative parity, but it does not mean that parity is violated, since particles with 
— p exist equally. However, if the S-matrix contains an observable that has negative 
parity, it means parity is violated in the scattering. For instance, 

j . p = j p cos (9.90) 

is such an observable. In the strong magnetic field at ultra-low temperatures, a 
nucleus can be polarized along the magnetic field. An asymmetry in the angular 
distribution of the decay particles from the nuclei means that parity is violated in 
the decay. Here, the angle of the particle is defined relative to the magnetic field, i.e. 
the spin orientation of the parent nuclei and the asymmetry is relative to a plane 
perpendicular to the polarization axis, i.e. whether is J~ jt/2. The experiment of 
Wu et al. [392] that proved parity violation in the weak interaction was determined 
this way. 

In Ji-p scattering where the initial proton is unpolarized, the final proton can 
be polarized perpendicular to the plane of scattering, namely a ■ fei x fc 2 7^ 0; 
where fei, k 2 are the pion momenta before and after scattering, respectively. This 
is because the parity of a ■ fei x fc 2 is positive. However, if the recoiled proton 
is longitudinally polarized, i.e. polarized along its momentum, which means 
a ■ p / 0, conservation of parity is violated. 

The origin of the parity- violating transition can be traced back to the Hamilto- 
nian. If parity is conserved in an interaction, the parity operator commutes with 
the Hamiltonian. Since the scattering matrix is made from the Hamiltonian, it 
commutes with the S-matrix, too. In order for the S-matrix to contain the parity- 
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violating term, the Hamiltonian must contain a parity-violating component, too: 

H = H + H PV (9.91a) 

PH P- 1 = H , PHpvP" 1 = -H PV (9.91b) 

where H and H PV are a scalar and a pseudoscalar, respectively. H PV is the origin 
of the parity-violating transition. We shall see how the parity-violating observable 
is related to H PV . 



Let us pause to consider the meaning of the extra parity- violating term in the 
Lagrangian. There are observables that change sign by the parity transforma- 
tion P. The momentum p and helicity a ■ p are examples. Their existence 
alone does not mean parity violation. We claim that parity is violated if a phe- 
nomenon exhibits a different behavior in the mirror world than in ours. This 
means the dynamical motion in the mirror world is different for a given ini- 
tial state. Namely, it happens if the transition amplitude includes a term that 
behaves differently in the mirror world, in other words, if it includes odd-par- 
ity terms such as a ■ p. As the transition amplitude is a matrix element of 
the Hamiltonian for given initial and final states, the Hamiltonian itself has 
to include an odd-parity term to induce the parity-violating phenomenon. In 
mathematical language, it simply means [P, H] / 0. Calculations of the tran- 
sition amplitude will show that if C" is the coefficient of the odd-parity term 
in the Hamiltonian, the parity-violating observables always appear multiplied 
by C. 



The weak interaction is known to violate parity. Beta decay, (A, Z) ->■ (A, Z + 
1) + e~ + v e or in the quark model d -» u + e~ + v e , is mediated by a charged 
force carrier, the W^ vector boson, but in the low-energy limit, the interaction 
Hamiltonian is well described by the four-Fermi interaction, which can be written 

H lm =(u Y/l d){-e(C vY f ' + C^ 5 )v e } 

+ (uY,Y 5 d){-e(C A y"Y S +C' A y")v e }+h.c. 

where u, d, v e , t~ stand for the Dirac fields and h.c. means the hermitian con- 
jugate of the preceding term. Referring to Table 9.2, we see that terms with Cy, 
Ca, which contain V fl V, A^M 1 , are parity conserving, while those with Cy, C A , 
which contain V u A fl , A /t V fl , are parity violating. Namely, C v , C A and Cy, C A are 
the strengths of the parity-conserving and parity-violating terms, respectively. Since 
the observed rate is proportional to the square of the amplitude, the effect of the 
parity violation appears as the interference term. The total rate F is proportional to 
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H | 2 + | Hpv| 2 . By preparing suitable initial and final states of the nucleus, it can 
be arranged to pick up only the C A , C A part. 3 ' Then the angular distribution of the 
electron is shown to have the form 

— - ~ l + a ™' Pe ' = 1 + aPvcosO (9.93a) 

2Re (C*C' A ) 

P is the polarization of the parent nucleus. This shows clearly that the origin of 
the parity-violating observable J ■ p c is the parity-violating Hamiltonian H PV . The 
effect is maximal when C A = | C A | , and experimental data have shown this is the 
case (C A ~ — C A ). Experimental data have fixed the strengths as well as the phases 
of the coupling constants (C v = — C v , C A = — C A , C A = Cv) and excluded other 
types (i.e. S, P, T) of quadrilinear forms. The Hamiltonian for beta decay has been 
shown to be 

H INT = ^={wy„(i - y 5 )d}{ey"(i - y 5 )M (9.94) 

Here, C v was replaced by the universal Fermi coupling constant Gp/\/2, Gf = 
10 — 5 x m 1 , where m p is the mass of the proton. This is called the V-A interac- 
tion and was the standard phenomenological Hamiltonian before the advent of the 
Standard Model. 

9.2.2 

Time Reversal 

Time Reversal in Quantum Mechanics 

Time reversal of a process means to make it proceed backward in time. In a macro- 
world, when a glass is dropped on the floor, the reverse process is that the broken 
pieces fly back to where the glass was broken and amalgamate to shape the original 
glass. Everybody knows this does not happen. But in the micro-world, Newton's 
equation of motion under the influence of a force F(x) that does not depend on 

m l^ = F{X) (9 ' 95) 

is invariant under the time reversal operation t — >• — t, and this is indeed realized. 
Physical objects that are dependent on time, for example the velocity v = dx/dt, 
the momentum p = mv, the angular momentum I = x x p change their sign 
under time reversal and are suitable observables to test the hypothesis. Take the 
typical example of jz-p scattering: 

x{Pi) + V(Vi)^x(V,) + P(Pa) (9.%) 

3) If the so-called Gamow-Teller nuclear transitions with |AJ| = 1 and no parity change are 
not contribute (C v = Cy = 0). 
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The time-reversed process is a reaction where the outgoing particles reverse their 
momenta, go back to where they were scattered, undergo the reverse interaction 
and recover the initial states with their momenta reversed (Fig. 9.1): 

*(-*s) + Pi-P*) -+ *H»l) + P{~Pl) < 9 - 97 ) 

Namely, their positions are unchanged, but their momenta reversed and the initial 
and final states are interchanged. Expressing this in terms of the transition ampli- 

(T<P(p f )\Tip( Pl )) = (xl>'(v\)\<t>'(p' f )) = (<P(-P f )\V(-P l ))* (9-98) 

The time-reversal operation seems to include the complex conjugate operation in 
addition to ordinary unitary transformation. Let us see if the notion is valid in 
quantum mechanics. Consider a Schrodinger equation 

dip It) 
j-|ii = Hip(t) (9.99) 

and reverse the time direction of the wave function (H is independent of time) 

*-►*' = -*. V(t) -► V'(*0 = W) (9.ioo) 

dip'it') , , , , , 

i \ , = H'v'(t'), H' = THT' 1 (9.101) 

If H'= H, 

-i-^ = H(T^) (9.102) 

This shows that in order for the transformed wave function Tip to satisfy the same 
equation as Eq. (9.99), requiring THT~ l = H is not enough; it is also necessary 
to take the complex conjugate of both sides. The process is known as Wigner's 
time reversal. In other words, Eq. (9.98) is a necessary condition for the time re- 
versal operation in quantum mechanics. Under time reversal, the wave function is 
transformed to 

ip(t)^ip'(t') = Tip(t) = ip*(-t) (9.103) 
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When ip represents a plane wave with momentum p 

y(t, x ;p) ~ e >P-*-> Et -+ e ,p ' x - ,£t '|* = _ ( = e -'P--' £t (9.104) 

time reversal reverses the momentum of the state to — p as expected. In general, a 
transformation of the form 

ip(t) = a0i(t) + h4> 2 (t) 

^ip' = a*<j>[(t') + b*<j>' 2 (t') = 

is called an antiunitary transformation. This means the T operation is expressed 
as a product of a unitary transformation U and complex conjugate operation K. 
Time-reversal invariance means the observables do not change under the opera- 
tion Eq. (9.98). As the observables are expressed as the square of the transition 
amplitude, their antiunitarity does not produce any contradiction. 

In fact, the necessity as well as consistency of complex conjugation can be shown 
in many examples. For instance, in classical mechanics p = mdx/dt -» — p, but 
in quantum mechanics p is a space differential operator and has no t dependence. 
By complex conjugation it obtains the right sign. Besides, the quantum condition 

[*i.pj] = i6tj (9.106) 

and the commutator of the angular momentum 

[L it Lj] = i£ ijk L k (9.107) 

are not invariant for the change p — >• —p, L — »■ — L but are invariant after an 
additional complex conjugate operation. In conclusion, the time-reversal operation 
requires, in addition to flipping the time, the complex conjugate to be taken of all 
c-numbers in the equation. 

Problem 9.4 

Show the transformation properties under the time-reversal operation. 

Electric field E(t) -+ E'(-t) = E(t) 

Magnetic field B(t) -» !»'(-*) = ~^) ,q im 

Potential A'< = (</>, A) -+ M"(t') = (<p'(-t), A'(-t)) [ ' ' 

= (<P(t),-A(t)) = A /i (t) 



Time Reversal of the S-Matrix 

The S-matrixis defined as lim t^oo U(t, t') [Eqs. (6.11), (6.12)]. Since T U(t) T~ 
U(—t), it is obvious that TST -1 = S ' . Another way of seeing this is to use 

(out : q f \q { : in) = (out : q f \S\q { : out) = (in : q f \S\ qi : in) (9.11 
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From the first expression, v 



(out: q f \q t ;m) = (out: ^T^T^ : in) = (in : T(q f )\T( qi ) : out)* 
= (out: r(qi)|T(g/);in) = (out : T(qt)\S\T(q f ); out) 

(9.110a) 

where we have used T|in) = |out), T|out) = |in) to go to the third equality. From 
the last expression in Eq. (9.109), we obtain 

(in : q f \S\ qi : in) = (in : q f \T~ l T ST~ l T\ qi : in) 

= (out : T(q f )\(TS T" 1 )!^,): out)* (9.110b) 

= (out:T(q i )l(TST- 1 )t|T( ?/ ):out) 

Comparing the last expressions of Eq. (9.110a) and Eq. (9.110b), we conclude 

TST' 1 = S 1 (9.111) 

Time Reversal of Partial Waves 

As the spin operator has no correspondence in classical mechanics, we are not sure 
how it transforms under the T operation. Let us assume its transformation property 
is the same as that of the orbital angular momentum, then s ~ x x p changes its 
sign, too. Therefore 

T]iT- l = -h, TJ±T- l = -fr (9.112a) 

J Z T\JM) = -TJ Z \JM) = -MT\JM) 

:. T\]M) = r,(J,M)\J,-M) (9.112b) 

The second equality of the first line follows from complex conjugation of i. To deter- 
mine the phase r], we use the conventional relation among the angular momentum 
eigenstates: 

TJ_|/M) = [(J + M)(J -M + 1)] 1/2 T|_/M - 1) 

= [(J + M)(J ~ M + l )] lll V(], M - 1)\J,-M + 1) 
= -J+T\J, M) = - V (J,M)J+\J,-M) 
= -riU, M)[(J + M )U ~M + l)] 1 ' 2 !/, -M + 1) 
,-, tj(J, M) = -ri(],M-l) (9.113a) 

This equation means 

V(J,M) = n(J)(-l) M (9.114) 

t](J) is a factor independent of M, and can be chosen considering the rotational 
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invariance of the S-matrix and the antiunitarity of the T reversal. By choosing 

r](J) = ("I) 7 4) (9-115) 

we can fix the phase as 

T\JM) = (-1)J- M \J,-M) (9.116) 

The phase of the helicity state can be determined similarly: 

T\JMA) = (-1) ] ~ M \J,-MA) (9.117) 

To prove the equality, we use the fact that the helicity does not change its sign under 
T and that the T-transformed one-particle state \pX) is rj\ — pA) which is equivalent 
to 180° rotation on the y axis. Namely 



where e is a phase factor. The rest of the argument u 
derived the parity of the state in Eq. (9.87). 



Prove that the T-operated helicity eigenfunctions for a spin 1/2 particle given by 
Eq. (4.14) 

rcosf e -^/ 2 i r-sinf e -'^i /01im 

*+ = Lsinf^J' X - = l cosf^/ 2 J (9 - U9) 

are given by 

Tx± = -i°iX*± = (-1) S_M XT (9.120) 

Finally, as to the helicity scattering amplitude, using the antiunitarity of the T 
transformation and Eq. (9.117) we find 

{A 3 A 4 \S J \Aih) = {AiX 2 \S J \hh) (9-121) 

Time Reversal of Fields 

Scalar and Vector Fields We define the T reversal of a scalar field that obeys the 

Klein-Gordon equation by 

9»(t, x) X <p'(x') = T<pT~ l = <p'(-t, x) (9.122) 

Since the free-field Lagrangian is bilinear in form in <p and (p' ( , both choices 

<p f = <p and <p' = (p* (9.123) 

4) The conventional spherical harmonic function Y LM is defined with r/(J) = 1. To be consistent 
with our definition, it has to be changed to Y LM —>■ i L Y LM , but this is not a problem in this book. 
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are possible. But here we impose a constraint by requiring that the wave function 
(denoted as ip = c-number) should respect the rule we saw in quantum mechanics. 
As the field cp can be expanded in terms of plane waves as 

<p = J2 : J^[a i e- ihx + b{e ih *] (9.124) 

the wave function tp(t, x) describing a particle with momentum p can be extracted 
by 

yj(t,x) = (Q\cp(t,x)\p) (9-125) 

Similarly, the T-reversed state should be able to be extracted by 

y'(-t,x) = {TO\<p(t,x)\Tp) (9.126) 

But for a c-number wave function, it obeys the rule of Eqs. (9.98) and (9.103) 

v/(-t, x) = v(-t, x)* Eq ^ 98) <oM-t, x)\ P F (9 127) 

= (T0\T{<p(-t,x)p}) = {T0\T<p(-t,x)T- 1 \Tp) 

Comparing Eqs. (9.126) and (9.127), we can choose the T-transformed field as 

Tq>(t,x)T- 1 =q>(-t,x) (9.128) 

subject to the additional constraint of taking the complex conjugate of all the 
c-numbers. S| Referring to the expansion formula of<p, the transformation proper- 
ties of the annihilation and creation operators are 

a k -+ Ta k T~ l = a.- k , b[ -+ TfeT -1 = b'_ k (9.129) 

The transformation is consistent with the expression 

\k) = V2mal\0)^^/2ma!_ k \0)=\-k) (9.130) 

Under T reversal, the current operator of the scalar field 

j" = iq(^d f '(p - 3'> V) (9.131) 

changes its argument t — > — t and i —*■ —i. Therefore the transformation is 

TjV.tjr- 1 = j°H,x), Tj(t,x)T~ l = -j(-t,x) 

T (9.132) 

i.e. j<-(t,x)^j fl (-t,x) 

As the vector field obeys Proca's equation 

3,, /''" + m 2 V = j " , f" v = d" V" - 9" V (9.133) 

the consistency argument tells us that 



V(t,x)-+ V fl (-t 


,X) 
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Dirac Field As the component of angular momentum changes its sign under time 
reversal, rearrangement of the spin component is necessary for the Dirac field. 
What we have to do is to find a transformation such that 

y>(x)$- Ty(x)T~ l = y'(t',x) = Bil>(-t,x) (9.135) 

satisfies the same Dirac equation. Here, B is a c-number 4x4 matrix that operates 
on spin indices. Consider the Dirac equation for the wave field 

(yKid, - m)xp(t, x) = oi (y"i3; - m) *>'(*'. x) = (9.136) 

where d' /t = (-9 , V). Substituting Eq. (9.135) in Eq. (9.136) and multiplying B _1 
from the left, we obtain 

B"V*H)(-3o) + 7* ■ (-iV) - m\Bxp(-t, x) 

= [B- 1 y°*B{id ) + B- l y*B-(-iV)-m]ip(-t,x) = 

Therefore, if B which satisfies the following relations exists 



the time reversed field \p satisfies the same equation and T transformation invari- 
ance holds. Considering y 2 * = — y 2 , y"* = y>'(fi / 2), we can adopt as B 

B = i ylyl = -iy5 C = -2 2 =^° 2 _^J (9.139) 

ip'(t') = Txp{t)T~ l = Bip(-t) = iy VVH) (9.140a) 

^Y) = Tlp(t)T- 1 = Ipi-tjB- 1 = ^(-t)iy V ■ (9.140b) 

The matrix B has the property that 

B = Bt = B~ l =-B* 

, * (9-141) 

B-V*B = y„ 

Therefore, the transformation property of the Dirac bilinear vector is given by 

^ 1 (t)y'> 2 (t)i^' 1 (-t)y"*^(-t)=^ 1 B- 1 y"*BV2 

= ¥i(-t)y^i(-t) (9.142a) 



The transformation properties of other Dirac bilinears are given similarly and we 
list them in Table 9.3. 
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Table 9.3 Properties of Dirac bilinears under T transformation. 

S(t,x) P(t,x) V{t,x) A"[t,x) T<"(t,x) 

T S(-t,x) -P(-t,x) V fl (-t,x) A„(-t,*) -T t „,(-t,x) 

Note 1: S = %np, P = iipy^ip, V = ~xpy>' f, A' 1 = !fy>' y 5 y>, T'" = lpo'"'\p 
Note 2: V = (V°, V), V fl = (V°,-V) 



Experimental Tests 

Principle of Detailed Balance: The amplitude of a scattering process from a initial 
state \i) to a final state |/> and its inverse / -> i are related by Eq. (9.121). The 
cross section of a process A + B — >• C + D, assuming both A and B are unpolarized 
is expressed as 

— ^(AB^CD)= Vl fxrAnA A A B | 2 (9.143) 

dQ [ ' (2S A + 1)(2S B + 1) Y 

On the other hand the cross section of the inverse process is, again assuming both 
C and D are unpolarized, 

4^(CD^AB) = V| f^ArAj 2 (9.144) 

dQ K ' (2S C + 1)(2S D + 1) Y 

As Eqs. (9.121) and (9.47) mean 

PL E I/^ d ,a a AbI 2 = pJoE I/w b ,a c a d | 2 (9.145) 

where £>ab and p C D are the momenta of particles in CM of AB and CD, respectively, 
we obtain 

do(AB -> CD) = ff CD 2 (2Sc + 1)(2S D + 1) 
da(CD^AB) j5 A B 2 (2S A + 1)(2S B + 1) 

The equation is referred to as the "principle of detailed balance". 

Strong Interaction As an example of the application of the principle of detailed bal- 
ance to test time-reversal invariance, we show both cross sections of the reactions 
p + 27 A1 ^ a + 24 Mg in Fig. 9.2. The data show T-reversal invariance is preserved 
in the strong interaction to better than 10 -3 . 

Weak Interaction 

There is evidence that small CP violation exists in the kaon and B-meson decays 
(see Chap. 16), hence the same amount of T violation is expected to exist as long 
as CPT invariance holds, which is discussed in Sect. 9.3.3. In fact, a T-violation 
effect consistent with CPT invariance has been observed in the neutral K meson 
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decays [107], which will be described in detail in Chap. 16. With this one exception, 
there is no other evidence of T violation. As it offers some of the most sensitive tests 
for models of new physics, we will discuss some of the experimental work below. 

Observables that violate T invariance can be constructed as a triple product of 
T-odd variables like ap l x p 2 , pyp^p^- In the first example, a can be prepared 
by a polarized beam, polarized targets or spin of scattered/decayed particles. 

K + -¥ m° + fi + + v fl : In the decay K^~ —> n° + /<+ + v^, the transverse 
component of muon polarization relative to the decay plane determined by p x p n 
(which is <7 /( • p u x pj was determined to be 1.7 ± 2.5 x 10~ 3 [225, 226]. The 
imaginary (i.e. T-violating) part of the decay amplitude was also determined to be 
Im(|) = -0.006 ± 0.008 (see Sect. 15.6.3 for the definition of §). 

n ->■ p + e~ + v e : Another example is the triple correlation D of the neutron 
polarization and the momenta of electron and antineutrino in the beta decay n -► 
p e~v e . It is defined as 



dW ocl + DP n -p e 



(9.147a) 



where P n is the polarization of the neutron. The experimental value is given as 
D = — 4 ± 6 x 10 — 4 [350]. The data can be used to define the imaginary part of the 
coupling constant. The relative phase 0av is related to D by 



A= f . 0AV = Arg f 

lev I LgvJ 

.„.___ n(! + 312 ) 



where gA, gv are axial and vector coupling 

and C A of Eq. (9.92)]. A v is given as 180.06° ± 0.07 c 



(9.147c) 
of the weak interaction [ C v 




Figure 9.2 Test of detailed balance and time reversibility in the r 
The intensity of the T-violating effect (g = |/T-violatin g //T-mv| 2 ) i 



n 27 A1 + p ^ 24 Mg + . 
Her than 5 x 1(T 4 [66]. 
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The Electric Dipole Moment of the Neutron 

The definition of the electric dipole moment (EDM) is classically 



■h 



)xd i x (9.148) 

which is nonzero if the charge distribution within an object is polarized, in other 
words, not distributed evenly. When the object is a particle like the neutron, re- 
gardless of whether of finite or point size, its only attribute that has directionality 
is the total spin a. If the particle has a finite d, it has to be proportional to a. While 
the transformation property of a under P, T is +, — , respectively, that of d is — , +, 
as can be seen from Eq. (9.148). Therefore the existence of the EDM of a particle 
violates both P and T As the neutron is a neutral composite of quarks, its EDM can 
be a sensitive test of T-reversal invariance in the strong interaction sector as well as 
the weak interaction. The interaction of the magnetic and electric dipole moments 
with the electromagnetic field is given by 

H INT = -fi- B-d- E (9.149) 

Its relativistic version can be expressed as 



I Problem 9.6 
Show that in the nonrelativistic limit, Eq. (9.150) reduces to Eq. (9.149). 

For an s = 1/2 particle, depending on the magnetic quantum number m = 
±1/2, the energy level splits in the electromagnetic field. When an oscillating field 
is injected, a resonance occurs corresponding to the Larmor frequency 

AE = 2fiB±2dE = h(v±Av) (9.151) 

Here ± corresponds to the directions of the magnetic and electric fields being 
either parallel or antiparallel to each other. Therefore from the deviation Av of 
the resonance frequency when the electric field is applied, one can determine the 
strength of the EDM. What has to be carefully arranged experimentally is the par- 
allel alignment of the electric field relative to the magnetic field, because if there 
exists a perpendicular component E± a magnetic field of y v x E± for a moving 
system with velocity v is induced and gives a false signal. 

An example material used for the experiment is the ultra cold neutron (UCN) of 
T ~ 0.002 K, which has energy ~ 2 x 10 -7 eV and velocity v ~ 6 ms" 1 . This is 
the energy the neutron obtains in the earth's gravitational field when it drops 2 m. 
The de Broglie wavelength is very long (X ~ 670 A), and consequently the neutron 
interferes coherently with material, allowing the index of refraction n to be defined: 



l 2 N« coh ± fiB Y 1 
jt Mv 2 /2j 



(9.152) 
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Figure 9.3 (a) The neutron EDM experimen- 
tal apparatus of the RALySussex experiment 
at ILL [37, 198]. The bottle is a 20 liter stor- 
age cell composed of a hollow upright quartz 
cylinder closed at each end by aluminum elec- 
trodes that are coated with a thin layer of car- 
bon. A highly uniform 1 uT magnetic field Bo 
parallel to the axis of the bottle is generated by 
a coil and the electric field (Eo) is generated 
by applying high voltage between the elec- 
trodes. The storage volume is situated within 



Applied Frequency (Hz) 

four layers of mu metal, giving a shielding fac- 
tor of about 10 000 against external magnetic 
fluctuations, (b) A magnetic resonance plot 
showing the UCNs with spin-up count after 
the spin precessing magnetic field has been 
applied. The peak (valley) corresponds to the 

the analyzing foil. The measured points to de- 
tect the EDM effect are marked as four cross- 
es. The corresponding pattern for spin down 
is inverted but otherwise identical. 



where N is the number of nuclei per unit volume, a co h is the coherent forward 
scattering amplitude and Mv 2 /2 is the kinetic energy of the neutron. The ± sign 
depends on whether the magnetic moment and the field are parallel or antiparallel. 
Inserting the actual values, we obtain a total reflection angle ~ 5° for v ~ 80 m _1 , 
but for a velocity v < 6 m _1 the angle exceeds 90° and total reflection is obtained at 
any angle. Namely, the neutron, if it is slow enough, can be transported through a 
bent tube just like light through an optical fiber and also can be confined in a bottle 
for a long duration of time (referred to as a magnetic bottle or a neutron bottle). 
That the refractive index differs depending on the polarization is used to separate 
neutrons of different polarization. 

Thermal neutrons emerging from the deuterium moderator of a reactor are 
transported through a curved nickel pipe and further decelerated by a totally re- 
flecting turbine before arriving at the entrance of the apparatus. Figure 9.3 shows 
the apparatus for the experiment [37, 198]. 

A magnetized iron-cobalt foil of 1 |J,m thickness is used to block neutrons of one 
spin orientation using the principle of total reflection stated above. The neutron 
bottle, a cylindrical 20-liter trap within a 1 ^T uniform magnetic field B , is able to 
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store neutrons for more than 100 s. The electric field, of approximately lOkV/cm, 
was generated by applying high voltage (HV) to the electrode. After application of a 
resonant oscillating magnetic field (~ 30 Hz) perpendicular to B , which turns the 
neutron spin perpendicular to the magnetic field and makes it precess freely, the 
shutter is opened, and the neutrons leave the bottle, dropping and passing through 
the iron-nickel foil again, which this time acts as an analyzer of the polarization. 
Only those that remain in the initial spin state can reach the detector, which is 
located below the UCN polarizing foil. The ratio of the two spin states is determined 
by the exact value of the applied frequency, as shown in Eq. (9.151). 

Four points, marked with crosses, slightly off resonance, are measured where 
the slope is steepest. The existence of the EDM should appear as a frequency shift 
when the electric field is applied. The measurement did not detect any shift, giving 
the upper limit of the dipole moment of the neutron as 

d n < 2.9 x 10" 2S ecm (9.153) 

Let us pause and think of the sensitivity that the measurement represents. Crudely 
speaking, the neutron is spatially spread to the size of the pion Compton wave- 
length (~ 10 — 13 cm). If the T violation effect is due to the weak interaction, it is 
probably reasonable to assume that the relative strength of emission and reabsorp- 
tion of the W boson is ~ (gi/m^)/(g^/m^). Here, g w , g s are the strength of the 
weak and strong interactions. If we assume g w ~ g s , following the spirit of the 
unified theory, then the strength of the EDM induced by the weak interaction is 
expected to be roughly of the order 10 -13 x (ra^/m^) 2 ~ 10 _19 ecm. Since the 
experimental value is 6 orders of magnitude smaller than the expectation, we can 
consider the EDM of the neutron a good test bench for T-reversal invariance. At 
present, the expected value of the Standard Model using the Kobayashi-Maskawa 
model is 10 _33 -10 — 34 e cm. The probability of detecting the finite value of the EDM 
is small as long as the Standard Model is correct. However, there are models that 
predict a value just below the present experimental upper limit [2, 129], and an 
improved experimental measurement is desired. 



Internal Symmetries 

9.3.1 

U(l) Gauge Symmetry 

Conserved Charge 

We have already introduced the most fundamental result of the internal symmetry, 
the conserved charge current Eq. (5.38), in Chap. 5: 



J" = iq[<p~ ( d>'(p-d>'y f (p] 
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This was the result of the Lagrangian's symmetry, namely invariance under the 
phase transformation 

<p -+ e~ iqa (p , pf -+ <p^e iqa (9.155) 

which generally holds when the Lagrangian is bilinear in complex fields, including 
Dirac fields. The transformation Eq. (9.155) has nothing to do with the space-time 
coordinates and is an example of internal symmetry, referred to as 1/(1) gauge sym- 
metry of the first kind. 6 ' The space integral of the time component of Eq. (9.154) is 
a conserved quantity and generically called a charge operator: 

Q = -iqf d 3 x J2 [xt(x)9t(x) - K{x) V }(x)} (9.156a) 

- <pt J (9.156b) 



,ft 



The first expression is a more general form, while the second one is specific to the 
complex Klein-Gordon field. It is easy to show that Q satisfies the commutation 
relations 

[Q,(p] =-q<p, [Q,9*] = qq>i (9.157) 



This means that <p and yft are operators to decrease and increase the charge of the 
state by q. Then using the BCH formulae Eq. (9.15), 

e A Be ~A = B + [AyB]+ [ A>[A;B]] + ... + [ A> [ Ai[A ...[ AjB ]]]] + ... 



-iaQ = 



? V (9.159) 



Namely, the charge operator is the generator of the gauge transformation. In si 
mary, we rephrase Noether's theorem in quantum field theory. If there is a con 
uous symmetry, i.e. a unitary transformation that keeps the Lagrangiar 
a corresponding conserved (Noether) current exists. The space integral of its Oth 
component is a constant of time and is also the generator of the symmetry trans- 
formation. 

9.3.2 

Charge Conjugation 

Charge conjugation (CC for short or C transformation) is an operation to exchange 
a particle and its antiparticle, or equivalently an operation to change the sign of 

6) It is also called a global symmetry. When the phase is a function of space-time, i.e. a = a(x), it is 
called a gauge transformation of the second kind or a local gauge transformation, which plays a 
key role in generating the known fundamental forces (See Chapter 18). 

strangeness, hypercharge, etc. 
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the charge Q 7) Examples of the C transformation are exchanges of an electron 
and positron e~ ** e+, proton and antiproton p o J, and n + o jt _ . The 
photon and the neutral pion are not changed since they are charge neutral and 
the particles are their own antiparticles. The charge conjugation operation deals 
with variables in internal space, but because of the CPT theorem, it is inseparably 
connected with PT, space-time symmetry. This is because the antiparticle can be 
considered mathematically as a particle with negative energy-momentum traveling 
backward in time. 

Charge Conjugation of the Field Operators 

We learned in Chap. 4 that the charge conjugation operation generally involves 
taking the complex (hermitian for the operator) conjugate of the wave function or 
the field. This resulted from the fact that the interaction with the electromagnetic 
field is obtained by the gauge principle, which means replacement of the derivative 
by its covariant derivative 

3 fl -+ D fl = 3,, + iqAf, (9.160) 

Charge conjugation means essentially changing the sign of the charge, and the 
above form suggests it involves complex conjugation. Let us see if this is true. 
We start by considering the charge eigenstate 

Q\q,p,s z ) = q\q,p,s z ) (9.161) 

By definition, charge conjugation means 

C\q,p,s z ) = Vc \-q,p,s z ) (9.162) 

r] c is a phase factor. Then 

QC\q,p,s z ) = ricQj-q,p,s z ) = -rj c q\ - q,p, s z ) 

CQ\q,p,s z ) = qC\q,p,s z ) = qrj c \ - q, p, s z ) 
Therefore 

CQ=-QC or CQC~ 1 = -Q (9.163) 

As a particle state is constructed from the creation operator a r and its antiparticle 
from b<, we define the charge conjugation operator C with the properties 

Ca q C~ l = r} C b q , CbJC -1 = J/c«J (9.164a) 

Cb q C~ 1 = ijla q , Ca\C- l = rilb\ (9.164b) 

CC t =C t C = l, r]cvl, = l (9.164c) 

Applying the charge conjugation twice brings the field back to its original form, 
hence its eigenvalue is ±1 and is referred to as C parity. The definition Eq. (9.164) 
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of charge conjugation means the exchange <p <-»■ <?">' for the Klein-Gordon field, as 
we guessed from a simple argument. Namely 

C<pC~ l = J7c¥ )t , C<P< C~ x = yl<p (9.165) 

The expression for the charge operator Eq. (9.156) shows clearly that the inter- 
change (p ^ qfl changes the sign of the charge operator. With the above proper- 
ty, it can easily be shown that the charge operator defined by Eq. (9.156) satisfies 
Eq. (9.163). 

Problem 9.7 

Prove 

C = exp ijX>I-»D(*t-«*) < 9 - 166 ) 

has the required properties for the charge conjugation operator. 



Charge Conjugation of the Dirac Field 

For a field with spin, an additional operation is required to flip the spin component. 
;, for the Dirac field, 



Cip(x)C 1 = ?/c C> T (x) = ? 7cC'yV*(x) 
Clp(x)C- l = -rf c ip\x)C'- 1 

where C on the rhs is a 4 x 4 matrix that acts on the spin components as given in 
Eq. (4.101). Using Eq. (9.167) and the property of the C" matrix [see Eq. (4.98)], 

C'-V'C' = -(y") T (9-168) 

the bilinear form of the Dirac operators is changed to 

V^y'Vi ^ {-VcVlc'- 1 } y" [vcCy]} 
= vJ(y'') T ^i = -(¥iY l '^2) T 

= -¥iY^2 = - (WiY^i)' (9-169) 

The minus sign in the last equality of the second line comes from the anticommu- 
tativity of the field and the removal of the transpose in the last equality is allowed 
because it is a 1 x 1 matrix. By setting ip 2 = tyi = ty , we have a C-transformed 
current 

j£ = C(qlpyf i y)C- 1 = -qfy^ip = -j" (9.170) 

In quantum mechanics, the change of the sign is a part of the definition and had 
to be put in by hand, but it is automatic in field theory. Transformation properties 
of other types of bilinear Dirac fields can also be obtained by using Eq. (9.168), see 
Table 9.4. 



Table 9.4 Property of Dirac bilinears under C transformation. 

S[t,x) P[t,x) V[t,x) A"(t,x) V"(t,x) 

C S't(t.x) P'f(t,x) -V'<'~(t,x) Af~'((t,x) -Tf"'"f(t,x) 

Note 1: S = %np, P = iipy^ip, V = ~xpy>' f, A' 1 = !fy>' y 5 y>, T'"' = yo>'"\p 
Note 2: V = (V°, V), V fl = (V°,-V) 



Charge Conjugation of Vector Fields 

As the vector and the scalar field satisfy the same Klein-Gordon equatio 
charge conjugation operation transforms them as follows: 



The phase factor r] c cannot be determined for the free field. But if the field inter- 
acts, for instance, with the Dirac field, a Lorentz-invariant interaction Lagrangian 
has the form 

£int = fWiVKP + gWiY^iV, + (h.c.) (9.172) 

The second term (h.c.) is the hermitian conjugate of the first term, and by its pres- 
ence the Lagrangian becomes hermitian. The requirement of C invariance for the 
Lagrangian fixes the phase. Using Table 9.4 for the transformation of the Dirac 
bilinears, we obtain 

(p^tpK V"^-V" 1 (9.173a) 

/=/*. S = g* (9T73b) 

As we noted earlier, by C operation, in addition to the electric charge, the strange- 
ness, baryon number, lepton number and all other additive quantum numbers 
change their sign, and therefore the eigenstate of the charge conjugation has to 
be truly neutral in the sense that all the quantum numbers have to be zero. 

C Parity of the Photon The photon is a massless vector boson and is described by 
the Maxwell equation. 

d fl df'A r = qj r (9.174) 

The interaction Lagrangian of the electromagnetic interaction is given by £mj = 
—qj''A u . Since the electric current changes its sign by C transformation, A„ must 
also change sign to keep the Lagrangian invariant. Therefore, the C parity of the 
photon is — 1. This can also be derived from Eq. (9.173) when applied to a real 
vector field. The C parity of an n-photon system is (—1)". This is independent of 
both the orbital and the spin angular momentum. 
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Experimental Tests 

C in the Strong Interaction: Let us consider a process 

p + p -^ji+{jt°) + X J^ p + p^jz-(jr°) + X (9.175) 

If the strong interaction conserves the charge conjugation symmetry, the angular 
distribution measured relative to the incoming antiprotons should exhibit a sym- 
metry between 8 and n — 6, as shown in Fig. 9.4 and in the following equations. 
The energy spectrum should be identical in both cases, too: 

£<**) = £<*•;*-*> 

%(- + ) = %(--) (9-176C) 

These relations have been confirmed experimentally within the errors. 

C Parity of the Neutral Two- Particle System: A two-particle system consisting of a 
particle and its antiparticle is neutral and can be a C eigenstate. We consider them 
as identical particles in different states. If the spin and orbital angular momentum 
of the system are L, S, the C parity of the system is expressed as 

C = (-1) L+S (9.177) 



Proof: If they are fermions (// = e~ e+, p~p, etc.) they change sign by a par- 
ticle exchange because of Fermi statistics. The exchange consists of that of the 
charge (C), space coordinates and spin coordinates. Therefore 

-1 = C(-1) L (-1) S+1 (9.178) 

Here, we have used the fact that the spin wave function is symmetric when S = 1 
and antisymmetric when S = 0. The case for the boson system can be proved 
similarly. □ 



Figure 9.4 (a) p + p — >• jr+ + X, (b) p + p -*■ x + X. IfC is conserved, do(p + p — *■ 
ji+) = do{p + p ->• n~), where da stands for either da/dQ or da/dE. If the pion angle is 
measured relative to the incident antiprotons, da/dQ (8) = da/dQ (n- 6). 



I The p° meson is a vector particle with J p = 1 and decays into ji + + n . Prove 
it cannot decay into 2jt°. 

C in Electromagnetic Interactions: The most stringent test of C symmetry in elec- 
tromagnetic interactions comes from the nonexistence of the tt° — > 3y decay [311]: 

—5— — < 3.1 x 10" 8 90%CL (CL = confidence level) (9.179) 

The following decays are known to occur via the electromagnetic interaction 
through their strength (the decay rate). Asymmetry tests are not as stringent as 
that of decay branching ratios, but we list them here as another test. As >7(548) 
is a neutral scalar meson having 0~ and mass of 548 MeV, it changes to itself by 
C operation and so do jt° and y. Since jt + and n~ are interchanged, the energy 
spectrum of the decays 

r] -> jt + + n~ + 7T° (9.180a) 

r] -+ jt + + 7t~ + y (9.180b) 

should be identical. Experimentally, the asymmetry is less than 0.1%. 

As yr°, t] decay into 2y , their C parity is positive and they cannot decay into 3y . 
Conversely, if a particle decays into 3y or 7z°y , r/y its C parity is negative. 

Problem 9.9 

IPositronium is a bound state of an electron and a positron connected by the 
Coulomb force. For the I = ground state, there are two states, with S = 0, 1. 
Show that the S = state decays into 2y and the S = 1 state into 3y , but not vice 
versa. 

C Violation in Weak Interactions: Next we consider weak interactions, for example 

^ -+ e T + v + v (9.181) 

If C symmetry is respected, the helicity of the electron and that of the positron 
should be the same provided other conditions are equal. The helicity can be mea- 
sured first by making the electron emit photons by bremsstrahlung and then let- 
ting the photons pass through magnetized iron. The transmissivity of the photon 
depends on its polarization, which in turn depends on the electron helicity. The 
result has shown that h(e~) ~ -1, h(e + ) ~ +1 [111, 267]. Therefore charge con- 
jugation symmetry is almost 100% broken in weak interactions. The origin lies in 
the Hamiltonian. The parity-violating weak interaction Hamiltonian was given in 
Eq. (9.92). If we take into account the C transformation property of the axial vector 
(see Table 9.4) we have 

^y"y^i-^fiy"yV2 (9.182) 
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This and Eq. (9.173) mean under C transformation 

C V ^C* Cv'^-Cy'* 

(9.183) 

c A ^c*. c A ^-c' A * v ' 

Namely, if C invariance holds, C v , C A have to be real and Cy, C A have to be pure- 
ly imaginary. But the decay asymmetry of a polarized nucleus that was discussed 
following Eq. (9.93) has shown C' A = — C A = real. Therefore, C is also maximally 
violated. But the combined CP symmetry does not change the V-A Hamiltonian of 
Eq. (9.94) and is conserved. 

The chirality operator (1 — y 5 ) in front of the lepton field operator in Eq. (9.94) 
is the origin of h(e~) ~ h(v c ) = -1, h(e + ) ~ h(T e ) = +1 as we discussed in 
Sect. 4.3.5. The latter is typical of the weak interaction where the weak boson W 
couples only to the left-handed field (i.e. (1 — y 5 )V). This will be discussed in detail 
in Chap. 15. 

The V-A interaction Eq. (9.94) respects T invariance as well as CP invariance. In 
principle, CP invariance and T invariance are independent. However, the combined 
CPT invariance is rooted deeply in the structure of quantum field theory and is the 
subject of the next section. 

As long as CPT invariance holds, CP violation means T violation. There is ev- 
idence that small CP violation exists in the kaon and B-meson decays, hence the 
same amount of T violation is expected to exist. In fact, a T- violation effect consis- 
tent with CPT invariance has been observed in the neutral K meson decays [107], 
which will be described in detail in Chap. 16. 

9.3.3 

CPT Theorem 

CPT is a combined transformation of C, P and T. All the coordinates are inverted 
(t, x) — > (— t, — x) and the operation is antiunitary because of the T transformation. 
All c-numbers are changed to their complex conjugates. Writing the combined op- 
erator as , its action is summarized as 



(t,x) — + (-*,-*) 

c-numbers — >• (c-numbers) ,v 

(out|H|in) -> (0in|H|0out) 

0\q,p,a) ->• \q,p,-o) 



(9.184) 



The CPT combined transformation property of the Dirac bilinears is given by 

_ cpt U> 2 ry x r = S, P,T 

V!/> 2 ► \ V2 _ W1 (9.185) 

independent of the order of the C, P, T operations. If we take the hermitian con- 
jugate of the bilinears, the original form is recovered except for the sign of the co- 
ordinates. Transformation properties of the scalar, pseudoscalar, vector fields, etc. 
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are shown to be the same as the corresponding Dirac bilinears. It follows that any 
Lorentz scalar or second-rank tensor made of any number of fields are brought 
back to their original form by a combined operation of (CPT + hermitian conju- 
gate). Therefore the Lagrangian density (a scalar) and the Hamiltonian density (a 
second rank tensor) transform under CPT 

X(t, x) -^ £' (-t, -x) (9.186a) 

3{{t,x)^J{'(-t,-x) (9.186b) 

Since X and 31 are hermitian operators and the action is their integral over all 
space-time, we conclude that the equations of motion are invariant under CPT 
transformation. The Hamiltonian H = f dr'xM changes the sign of time t, but as 
H is a conserved quantity and does not depend on time, the transformed Hamilto- 
nian is the same as before it is transformed. As the scattering matrix is composed 
of the Hamiltonian, it is invariant, too. Unlike individual symmetry of C, P or T, 
the combined CPT invariance theorem can be derived with a few fundamental as- 
sumptions, such as the validity of Lorentz invariance, the local quantum field theo- 
ry and the hermitian Hamiltonian. The violation of CPT means violation of either 
Lorentz invariance or quantum mechanics. 8 ' Theoretically it has firm foundations. 
Experimental evidence of CPT invariance is very strong, too. 

We list a few examples of the CPT predictions: 

(1) The mass of a particle and its antiparticle is the same. 

Proof: Let H be the CPT-invariant Hamiltonian. As the mass is an eigenstate of 
the Hamiltonian in the particles rest frame 

m = (q,a\H\q,a) = (q, a\@~ l © H0~ 1 0\q, a) = (0q,a\H\© q,a}* 
= (q,-a\H\q,-a) (9.187) 

The spin orientation of \q, — a) is different from \q, a), but the mass does not 
depend on the spin orientation (see Poincare group specification for a particle state 
in Sect. 3.6). Therefore the masses of the particle and antiparticle are the same. 



(2) The magnetic n 
sign is reversed. 

(3) The lifetime of a particle and its antiparticle is the same. 

Proof: Writing the S-matrix as 

S, ia = 1 + i2jzd(M u - E f )(j3;out\T\a;out) 
8) It is pointed out that CPT might be violated in quantum gravity [130, 210]. 
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Table 9.5 Summary of transformation properties under C, P and T. 

S[t,x) P(t,x) V{t,X) A"[t,x) 7""(t,x) 

P S(t,-x) -P(t,-x) V fl (t,-x) -A fl (t,-x) T flv (t,-x) 

C St(t,*) p-f(t.x) -V"t(t,x) A'<(t,x) -T""i-(t,x) 

T S(-t,x) -P(-t,x) V fl (-t,x) A„(-t,x) -T l<v (-t,x) 

CP St(t,-x) -Pt(t,-x) -Vj(t,-x) -A„t(t,-x) -T llt /'(t,-x) 

CPT S'(-t,-x) Pt(-t,-x) -vH{-t,-x) -A" J f(-t,-x) T'"<(-t,-x) 

Note 1: S = Ifnp, P = lpy s ip, V = i>y''ip, A'' = lpy"y s ip, V" = lfo f ' v ij' 

Note 2: The differential operator 3,, transforms exactly the same as the vector field V„ except it does 

not change sign under C-operation. 

the total decay rate of a is given by 

r(a,q a ) = 2jiJ2 d(M a - E f )\(fi,qp;out\T\a,q a ;out)\ 2 
0,9/5 
= 2jtJ2d(M a - Ef)\{fi, q^out]©- 1 T0~ l 0\a, q a ;out)*\ 2 

= 2^^5(M«-£ / )|(^^;in|T|a,q H ;in>*| 2 

= r(a,q a ) (9.189) 

where, q a , qp denote quantum numbers other than particle species. We have used 
the fact that for one particle state, |a,q a ;in) = |a,q„;out), M a = M„ and that 
both "in" and "out" states form complete sets of states, i.e. 

J2 \P,q0',ia){fi,q fii in\ = £ |/3, ?/3 ;out)(/3, ^;out| = 1 (9.190) 



Finally, in Table 9.5 we give a summary list of the transformation properties of C, 
P and T. 

9.3.4 

SU(2) (Isospin) Symmetry 

Isospin Multiplets 

Among hadrons there are many small groups in which members having different 
electric charge share some common properties. For instance, in the following ex- 
ample members have almost the same mass values (given in units of MeV) within 



f pi 938.272 ["jr+l 139.570 [ K+ ~\ 493 - 68 [ I+ ~\ 1189.37 

jt° 134.977 2° 1192.64 

\_n] 939.565 L^~J 139.570 \_K°] 497.65 |_^~J 1197.45 

dm/m 0.14% 3.3% 0.8% 0.33% 

(9.191) 

In addition, it is known that the strength of the interaction is almost the same with- 
in the accuracy of a few percent. Because they have different electric charges, the 
difference can be ascribed to the electromagnetic interaction, which is weaker by 
~ O(a) = 1/137. Therefore, if we neglect the small mass difference, there is no 
distinction among the members of the groups as far as the strong interaction is 
concerned. They constitute multiplets but are degenerate. In analogy to spin multi- 
plets, which are degenerate under a central force, we call them "isospin multiplets". 
Just as the degeneracy of the spin multiplets is resolved by a magnetic field (the 
Zeeman effect), that of the isospin multiplets is resolved by turning on the electro- 
magnetic force. We conceive of an abstract space (referred to as internal space in 
contrast to external or real space) and consider the strong interaction as a central 
force with the electromagnetic force violating the rotational invariance just like the 
magnetic field in external space. If we identify the isospin as the equivalent of spin 
in real space, the mathematical structure of isospin is exactly the same as that of 
spin, which is the origin of the name. 

Charge Independence 

Historically, the concept of isospin was first proposed by Heisenberg in 1932 to 
consider the proton and the neutron as two different states of the same particle 
(nucleon) when he recognized the fact that the energy levels of mirror nuclei are 
very similar. Consider the potentials between the nucleons V. When 

V pp = V nn charge symmetry (9.192) 

we call it charge symmetry and when 



we call it charge independence. To formulate the mathematics of the charge inde- 
pendence, we consider transformation of the proton wave function ip p and that of 
the neutron tp„ and define the doublet function tp by 

Charge independence means the expectation value of the potential does not change 
under the transformation U. Conservation of probability requires U to be a unitary 

UW = U'U = 1 (9.195) 
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Under this restriction, U should have the form 

U = e '*[_ a p* a*]' l«| 2 + l^| 2 = 1 < 9 - 1% ) 

If we further require U to be unimodular (det U = 1), we have 0=0. Generally, 
any transformation that keeps the norm of N complex numbers 



Lt makes a group that is called a unitary group U(N), and when det U = 
it is called a special unitary group SU(N). An SU(N) matrix can be expressed as 



; E M< 



where A; are traceless hermitian matrices (Appendix G) called g< 
SU(N) transformation. For N = 2, there are three traceless hermiti 
and we can adopt the Pauli matrices for them: 



Since I = r/2 satisfies the s; 

[Ii,Ij}=i£ijkh (9.200) 

as the angular momentum, the S U(2) transformation is mathematically equivalent 
(holomorphic) to the rotational operation. The difference is that the operand is not 
a spin state in real space, but a multiplet consisting of particles of roughly the 
same mass as those in Eq. (9.191). Namely, the act of rotation is performed not in 
a real space but in a kind of abstract space (referred to as isospin space) and the 
state vector xp denotes p when its direction is upward and n when downward. Such 
symmetry in the internal space is called an internal symmetry. External symmetries 
refer to those in a real space such as spin and parity. If we denote the difference 
of the states like p or n by a subscript r as <p r (x), the internal symmetry operation 
acts on r, while the external symmetry acts on the space-time coordinate x and legs 
(components) of Lorentz tensors. They are generally independent operations, but 
sometimes connected like CPT 

As the mathematics of SU(2) is the same as that of angular momentum, it is 
convenient to express various physical quantities using the same terminology as 
used in space rotation. For instance, p and n make a doublet; we say the nucleon 
has isospin J = 1/2. The three kinds of pions n , 71° constitute a triplet, or a vector 
in isospace, and have 1 = 1. The term "charge independence" of the nuclear force 
means the interaction between the nucleon and the pion is rotationally invariant in 
isospin space, namely the interaction Hamiltonian is an isoscalar. 



(9.202) 
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Conserved Isospin Current 

We consider the proton and the neutron as the same particle, but they have dif- 
ferent projections in the abstract isospin space and constitute a doublet, called an 
isospinor: 

xp = M (9.201) 

Here we have adopted the notation p, n to denote xp p ,xp n . The Lagrangian density 
is written as 

£=lp(i Y "d,-m)yj 

= f{iy"du - m)p + n(iy"d fl - m)n 

They should have the same mass because of the isospin symmetry. Note, p is a 
spinor with four components in real space, and ip a two-component spinor in 
isospin space. Therefore, xp has 8 independent components. In exactly the same 
manner as a rotation in real space, a rotation in isospin space can be carried out by 
using a unitary operator 

Uj = e -'£i«j J j = e -"*-> (9.203) 

where I = ( I\, I 2 , h) satisfies Eq. (9.200). Let us consider transformations defined 



xp ^ xp = %pei T ' a where xp = (p, n) (9.204b) 

where a = (aj, a%, a 3 ) is a set of three independent but constant variables. Since 
t ; , a ; are independent of space-time, the transformation does not change any 
space-time structure of the spinors p,n, and the Lagrangian density is kept in- 
variant under the continuous isospin rotation. Equations (9.204) have exactly the 
same form as the 1/(1) phase transformation of Eq. (9.155) and is called the (global) 
SU(2) gauge transformation. Now consider an infinitesimal rotation 

\p ->• xp + dip , Ip -^ip + dip 

Sip = --T>£; = --[T'}ablpb£i , dip = -lp~T'£i = ~lp a [T l ]ab£i 

(9.205) 

Applying Noether's formula Eq. (5.32) to the transformation, we obtain 

6£ „ ,*,.=,/, ,1/7 <5d£ , „ d£ 

J fl = <50 r = 6xb + 5xb 

J <5(^0 r ) Wr <5(9„V) (>(d fl 1p) 
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The second term in the second line vanishes because there is no d^lp in the La- 
grangian. Since e ; is an arbitrary constant, Eq. (9.206) defines a conserved current 
referred to as isospin current: 

IHj = ^Vy V (9-207) 

It is easy to check that an isospin operator defined by 

Ij = J ^xlpy — ^ (9.208) 

satisfies the relations 

[Ij,ip] = -Ijip, [Ij,y>i]= Ijip, e ifl '>r iai = e _iH 'V (9-209) 

Therefore the operator I is the generator of the isospin rotation which appeared in 
Eq. (9.203). The isospin current differs from the Dirac current only in the presence 
of the isospin operator r/2 sandwiched between the isospinors Ip and xp . 

In a similar vein, when we have three identical real Klein-Gordon fields <p = 
(0i, <Pi, <Pi), we can also establish the rotational invariance oftp-tp = (pi 2 + (p2 2 + 
<p 2 and that of the Lagrangian, which can be expressed as 

X = d fl 0f • 9'" <p - mV' • <p (9.210) 

leading to the existence of conserved isospin angular momentum, for which <p is 
a J = 1 triplet. Since the pions have spin and constitute a triplet of almost the 
same mass, we can identify them with the field (p. 9) In analogy to Eq. (9.208), it 
can be expressed as 

(9.211a) 



'/"•"" 



ro o oi r o o n ro -i 01 

= o o -i , t 2 = o o o , t 3 = i o o 

|_0 i oj l-i oj [o oj 

ientify (|jr+), \ji°), \ji~}) as the T 3 = (+1,0,- 
;e a slightly different representation: 

ro i oi ro -i oi 

= i o i , t 2 ' = -p \i o -i\ 
2 Lo i oj ^Lo i oj 



(9.211b) 

However, to identify (|jt+), |jt°), \ji~)) as the T 3 = (+1, 0, —1) state, it is more 
convenient to use a slightly different representation: 



h' = 

Lo 



9) Note, the pion field is a / = 1 triplet representation of the SU(2) group. If the fundament 
representation with I = 1/2 does not exist, we may have to consider the pion as the fundan 
representation of the SO(3) group, as jTj's are real fields. 



Problem 9.10 

Prove that I, I', satisfy 

[h, Ij] = ieijkh , [I,i. I*j] = ieo^-fe ( 9 - 213a ) 

Ik, Vj] = ie, jt V* ■ [J*i. 0jl = «ij»0» (9.213b) 



Problem 9.11 
Prove 



V2 



This equation can be used to define 
0i ± i 



03 = |«°) (9.215) 



The phase a p was chosen to produce 

f w± |I = 1, I 3 = 0) = a/2|1,±1) (9.216) 

which is a special case of the usual convention adopted from the angular momen- 
tum operator: 

/, ± |I, h) = [(I T I xi )(l ±h + 1)] 1/2 \I, h ± 1) (9.217) 

The real nucleon and the pion do not have exactly the same mass within the multi- 
ple! The isospin rotation is not an exact symmetry, but as long as the difference is 
small we can treat the isospin as a conserved quantity. 

Isospin of the Antiparticle 

In order to find the isospin of the antiparticle, we apply the charge conjugation op- 
eration to Eq. (9.194). Then the field represents the antiparticle. As the C operation 
contains complex conjugation, the transformation matrix becomes 

J ' = a*p+{3*n (9.218a) 

n' = -/3 p + a n (9.218b) 

here, ~p,n denote the fields of the antiparticles p, n. In general, if the symmetry 
transformation matrix is U, that of the antiparticle is given by its complex conju- 
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gate U*. Rearranging Eq. (9.218), we obtain 

(-if) = a (-n) + p (9.219a) 

f = -P*(-n) + a*J (9.219b) 



-?>[-;• *] ra-ra 



The equation means that if (p, n) is an isodoublet, (— n, p) is also an isodoublet that 
transforms in the same way. 

Charge Symmetry We mentioned charge symmetry in Eq. (9.192). It can be de- 
fined more explicitly as the rotation in isospin space by jt on the y-axis. For a dou- 
blet its operation is 

inh = -i W2) r 2 = _. = [0 -1] 



10 

Path Integral: Basics 



The path integral is now widely used in many fields wherever quantum mechanical 
treatments are necessary, including quantum field theory and statistical mechan- 
ics. In solving the simple problems that are treated in this book, it is hardly a more 
powerful tool than the ordinary canonical formalism. In fact, when ambiguities 
arise in the path integral formulation, they are usually resolved by comparing it 
with the canonical quantization formalism or Schrodinger equation. But because 
of its powerful concept and widespread use, acquaintance with the method is in- 
dispensable. Feynman's path integral expresses a particle's dynamical propagation 
as a coherent sum of an infinite number of amplitudes over all possible paths. 
The path that is dictated by the classical equation of motion is simply the most 
probable, thus the method yields important insight into the connection between 
the classical and quantum mechanical approaches. It has the remarkable feature of 
not using operators in Hilbert space. Only ordinary c-numbers appear in the for- 
mula, although at the expense of an infinite number of integrations. What we deal 
with primarily is the propagator or the transition amplitude. 

The purpose of this chapter is to give a basic idea of the path integral. Applica- 
tions to field theory are given in the next chapter. 



Since the underlying ideas and mathematics are very different from those of the 
standard canonical quantization, we go back to the basics of quantum mechanics 
and review some of their features to pave the way to the path integral. 1 ' 

10.1.1 

Bra and Ket 

We define a state or Dirac's ket |g) as the eigenstate of a position operator q, namely 
q\q) = q\q) (10.1) 

1) In this chapter we write % and c explicitly. 

Elementary Particle Physics, Volume 1: Quantum Field Theory and Particles. Yorikiyo Nagashima 
Copyright © 2010 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim 
ISBN: 978-3-57- '" 
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where q is the coordinate of the position; it is a c-number. In this book we use a 
"hat", e.g. A, to denote an operator with eigenvalue A. For any ket vector, we can 
define its conjugate, bra vector and a scalar (inner) product of a bra (<p\ and a ket 
\tp) as a pure number with the property 

(<PM = (ip\<Pf (10.2) 

The position eigenstates define a complete orthonormal basis. Namely they satisfy 

(q\q'} = d(q — q'): orthonormality (10.3a) 

/ dq\q)(q\ = 1 : completeness (10.3b) 

Similarly, eigenstates of the momentum operator p also satisfy the orthonormality 
and completeness conditions 

p\p) = p\p) 
(p\p') = 2jthd(p -p') (104) 

The position and momentum operators satisfy the commutation relation 



10.1.2 

Translational Operator 

If we define a translational operator by 

U(a)\q) = \q + a) 
it satisfies the relation 

qU(a)\q) = q\q + a >= (q + a)\q + a) = {U(a)q + aU(a)}\q) 
Since this holds for any \q) that are a complete orthonormal basis, 

[q, U(a)] = aU(a) 
We learned in the previous chapter that U(a) can be expressed as 

U(a) = e-^ 



10.1 Introduction I 269 

It is straightforward to show 

l]{a)~ l qlJ(a) = e ia r qe~ ia v = q+ a (10.10a) 

U(a)U(b) = U(a + b) (10.10b) 

It follows that an operator 0(q, p) that can be Taylor expanded in powers of p 
and q is transformed by U(a) as 

U(fl)- 1 0(§, i 5)i>(fl)=0(g+fl, i 5) (10.11) 

Using the translation operator, any eigenvector of the q is expressed as 

\q)=U(q-q )\q ) (10.12) 

where qo is the origin of the coordinate. When translational invariance holds, the 
origin can be placed anywhere. Without loss of generality we may put q = and 
the position eigenvector is expressed as 

\q) = U(q)\0) t (10.13) 

The subscript "q" is attached to remind us it is a position eigenvector. Let us see 
the effect of p acting on the position vector: 

\q + dq) = (l-dq l -p\\q) + 0(dq 2 ) 

, «=.»i»^^ = »> ,I0 * 14 ' 

dcj^o aq aq 

Multiplying by (p \ on both sides, we obtain 

(p\p\q) = p(p\q) = ih^-(p\q) (10.15) 

dq 

This has a solution 

(p\q)=Nexp(~pq\ (10.16) 

To determine the constant N, we use Eq. (10.4). Inserting a complete set in the 
intermediate states 

(p'\p) = J dq(p'\q)(q\p) = J dqN e ~i^' N* d^' 

= 27ltid(p'-p)\N\ 2 (10 - 17) 

••• |N| = 1 
Therefore, we have obtained the transformation amplitude: 

(q\p)={p\q)* =ex V ( l -qp) (10.18) 
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Then it can easily be shown that 

\p) = f dq \q){q\p) = f dq e*«|g) (10.19b) 

Namely, the position basis vector and the momentum basis vector are connected to 
each other by Fourier transformation. 

To find representations of p in the position basis, we use Eq. (10.14). Multiplying 
by (q\ on both sides, we obtain 

(q\P\q') = (iwJjW) = ( ih J^^) = ih jp d (l-*') 

q V q ' q (10.20) 

= -ih — d(q-q') 
dq 

Note: The operator p acts on the ket vector | q) and does nothing to ordinary c-num- 
bers. Equation (10.15) is different from the one used in the Schrodinger equation, 
p = —id/dq. This is because "bra" and "ket" are elements of abstract Hilbert space 
and the Schrodinger wave function is a representation of the ket vector \tp) in the 
coordinate basis. 2 ' Defining the wave function as 

y>(q) = (q\V) (10.21) 

The relations between the ket vectors 



q\q) = q\q) 






(10.22a) 


m = i*j£-i«) 

dq 






(10.22b) 


(q\q') = d(q-q') and l = jdq\q)(q\ 






(10.22c) 


2 converted to operations acting on the wave function: 








(q\q = q(q\ -+ (q\q\i/>) = q{q\i/>) = qip(q) 






(10.23a) 


(q\p = -ih — (q\ -> (q\p\ip) = -iti — (q\ip) = - 


dq 


>p(q) 


(10.23b) 


(<t>\xp) = J dq{<j>\q)(q\ip) = J dqcp(q)*ip(q) 






(10.23c) 


The situation is reminiscent of the vector A \A) = E ki)(« 
in Newton mechanics. Its length, direction on the basis ve 

it is an abstract entity. A representation expression for c 
( A i , A 2 , A 3 ) can be obtained by expanding mathematically 

e,. In the ket notation, it is written as 


i\A). The effect of an action 

tation. For instance, the 

n active rotation of a vector is 



(q\p) = e'"/* (10.24) 






(10.25a) 



Similarly 

ip(p) = j dqe-'P^ip(q) (10.25b) 

Notice the sign of the exponent is different from that of the \q) and \p) Fourier 
transformation of Eq. (10.19). 

10.2 

Quantum Mechanical Equations 

10.2.1 

Schrodinger Equation 

In terms of bras and kets, the Schrodinger equation is expressed as 

ifi^(t)) = H(p,q)U'(t)) (10.26) 

where the Hamiltonian H is time independent. Let us calculate the matrix element 
of the Hamiltonian operator in the position basis: 



H = 2^ + V( " q) 

(q\j^\q') = ^Jdy(q\p\y){y\p\q') 



dy 



- 1 d y-^ d ( l i-y)i^ d (y- l i') 



(q\V(q)\q'}=V(q)d(q-q') 
■•■ WH\q'} = \-^j^ + V(q)\d(q-q') 
In the q representation 

ih-(qW(t)) = j dq'(q\H(p,q)\q')(q'\xp(t)) 
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Inserting Eq. (10.28) 

iti-ip(tq) = H(q,-W)ip(tq) (10.29b) 

This is the familiar Schrodinger equation one encounters in textbooks of quantum 
mechanics. 

10.2.2 
Propagators 

The Schrodinger equation can be cast in an integral form. To do this we define a 
time evolution operator by 



If we insert Eq. (10.30) into the Schrodinger equation (10.26) and integrate, we 
obtain 

K(t u h) = 8(h- ye-^e^ = 0(t x - t 2 )e~^^-^ (10.31) 

The last equation holds when the Hamiltonian is time independent. The time evo- 
lution operator satisfies 



(**^; -#)*(*!'**) = '*<*(*! 



The propagator is given by taking matrix elements in a given basis. In the coordi- 
nate basis defined by g| g) = q\q), it is written as 

( qi \K(h,h)\q 2 ) = K(hqi;t 2 q 2 ) (10.33) 

Because K(ti, t 2 ) —> 1 as t\ — > t 2 , it satisfies the initial condition 

lim K(hqu t 2 q 2 ) = d(q x - q 2 ) (10.34) 

Equation (10.32) shows that it obeys the equation 

(ih— - H) K(t iqi ; t 2 q 2 ) = ihd(h - t 2 )d( qi - q 2 ) (10.35) 

which shows that it is the Green's function (or the propagator) for the time-depen- 
dent Schrodinger equation. We remind the reader that K(t\, t 2 ) is an operator while 
K(hqi; t 2 q 2 ) is a number. 

The Schrodinger equation assumes all the time dependence is contained in the 
wave function and the operator is time independent. We can also take the attitude 
that all the time dependence is contained in the operator and the wave function is 
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time independent, which is called the Heisenberg picture. They are mathematically 
equivalent. The difference lies in the choice of base vectors. In the field theoreti- 
cal treatment, the Heisenberg picture is generally more convenient and intuitively 
appealing, because equations of quantized fields are straightforward extensions of 
classical field equations. 

So we will investigate the propagator in the Heisenberg picture. Attaching the 
subscripts S or H to differentiate objects in the two pictures 

H (t) = ei &t O s e~i At (10.36a) 

\tq) li = e* iit \q) s (10.36b) 

Note, in the Heisenberg picture 

q H \tq) H = q\tq) H (10.37) 

Despite its apparent dependence on the time coordinate in the ket, it represents 
a stationary state. Therefore, it should not be interpreted as that the ket changes 
with time; rather it should be interpreted as that the dynamical development of 
an operator at time t is transformed to the state. The transition amplitude in the 
Heisenberg picture is given by 

H (»i«i|t2«2>H = s(qi\e-^ h e^ t2 \q2)s = s(qi\K(h,t 2 )\q2}s 
= K(hqi;t 2 q 2 ) 

Equation (10.38) states that the propagator is a probability amplitude for finding the 
system at some point q x in the configuration space at time t\ when it was originally 
at q 2 at time t 2 . Or stated differently, it is a matrix element of the time-ordered time 
evolution operator between the coordinate basis states in the Heisenberg picture. 

As stated earlier, the state vector |^) H = ex Ht \ip(t)) s in the Heisenberg pic- 
ture is stationary, hence the wave function (q|^) H is stationary. But as Eq. (10.36b) 
shows, the coordinate basis is moving and H ( tq | ip ) h is the same wave function as 
that appears in the Schrodinger equation: 

h(HMh = s{q\V(t))s = V(tq) (10.39) 

From now on, we omit the subscript H. Completeness of states enables us to write 

ip{t f q f ) = (t f q f \ip) = / dqi{t f q f \tiqi}{tiqi\Tp) 

J (10.40) 

= J dqiK(tfqf\Mqi)V(Mqi) 

This is Huygens' principle, that a wave at later time tj is the sum of propagating 
waves from qi at time t ; . The equation is very general and satisfies the causality 
condition (K(t f ; t ; ) = for t ; > t f ). 
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Feynman's Path Integral 



10.3.1 

Sum over History 



To calculate the propagator, we first divide the continuous time interval into N small 
but finite and discrete intervals 



exp \--H(t f - ti )j = n exp [-^H(tj - tj-^J 

and insert complete sets of coordinate bases in the intermediate time point (se 
Fig. 10.1). 
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Figure 10.1 To perform the path integral, the time coordinate is discretized to N intervals 
between t, and tf. At each t = tj , q t is integrated from — oo to oo. q, and q^ are fixed. 
/©<} = lim N _ 00 /d«},--.rf(j N _ 1 . 



Referring to Eq. (10.38), we can rewrite each intermediate inner product as 

(tjqj\tj-iqj-i) = (gjk - * H( ' J '~* J-l) l9j-i) = (qj\e~* eH \qj-i) 



If the Hamiltonian is of the form 



^^-Imm^^'^^^ 
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(qj\V(q)\qj-i) = Vfcj-iKqjlqj-i) = f ^{qjlpHpfoj-iWqj-!) 

• dp' dp 
Ijrhljrh' 

2jzh: ' 

• dp_ 

2jzh [ 
(q J \mp.q)\qj- 1 ) = J^(qj\p)(p\q J - 1 )H(p,q J - 1 ) (10.46) 



2n n {qjlp}{p]qj - l) 2^ (10A5) 

' dp 



The Hamiltonian is no longer an operator but is written in terms of classical num- 
bers. Generally, the Hamiltonian may include a polynomial of the product of op- 
erators q, p, as, for instance, the magnetic force on the charged particle. In such 
cases, a certain subtlety arises. If all p's stand to the left of the coordinate opera- 
tors, we can insert the complete momentum basis to the left of the Hamiltonian 
and obtain 

(qj\e-t' H M\qj-i) - j ^(*jlj>>(pl f 1 " ^H(§, q)] \qj-J 
_ f dp 



-I 



2jth (q j \p){p\q j - 1 )\l--eH(p,q j - 1 )^ 



which has the same form as Eq. (10.46). On the other hand, if all p's stand to the 
right of the coordinates, 

(qj\c-i' H ^\ qj -r) ~ f ^{qj\ [l - £eHtf,$)l \pHp\qj-J 

- f ^(qj\p)(p\qj-i) e ~^ H{p ' qi) a - 48 ) 

As there is no well-defined way to order the product of operators, we take a mid- 
point. 3 ' Putting (q j\p) = e lpq J, we obtain 

<<^W£^[ , '<''-^-M'- Iti H] 



3) For a special ordering called Weyl ordering, where the ordering of operators is symmetrized, 
like p 2 q = \[p 2 q + pqp + qp 2 ), the midpoint method is exact, and this is the conventional 
assumption. See [113, 255] for more detailed treatments. 
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Writing all the inner products similarly, we obtain 



( t/g/ |t ig> > = lim/.../^ 1 ...^_ 1 ^i...gf 



* 



Note that there are N inner products, hence N integrals over the momenta, but 
there are only N— 1 intermediate states, hence N— 1 integrals over the coordinates. 
In the continuum limit of £ -* 0, the exponent can be rewritten as 



■ i£ 



eH (p. 



= ^/ dt[pq-H(p,q)] = ^£ f dtL= l -S[q] 

where S[q] is the classical action. Therefore, the propagator may be symbolically 
written as 

(*/«/l*i«i) = / c «° (t^) ex P [l / / **{?«- H (^> 3)}] < 10 - 52 ) 

This is the path integral expression for the propagator from (tiqi) to (tjqj). In 
the continuum limit q becomes a function of t, and the integral is a functional 
integral. Each function q(t) and p (t) defines a path in phase space. The integration 
is over all functions and is infinite dimensional. An outstanding characteristic of 
the path integral is that the quantum operators q and p do not appear, but only 
their representations expressed in ordinary numbers. It is not obvious, however, 
that the infinite-dimensional integrals of this type are well-defined mathematically, 
i.e. if they converge. We proceed assuming they do. 

If the Hamiltonian has a special form, namely quadratic in the momenta, such 



we can perform the p-integration: 

1 { 2jrmti \ 1/2 Time f qj-qj-iX 2 ! 
/ m \i/2 lime /qj - qj-i\ 2 ~\ 4| 

Substituting this expression into Eq. (10.50), we obtain 

{t f q f \t iqi )= lim [j-^) j ■] dq 1 ...dq N - 1 

= C j VqexpU j dt^mq 2 - V(q)\~\ 

= cfvqe* s M (10.56) 

where C is a constant independent of the dynamics of the system and S [q] is the 
action of the system. This is the Feynman path integral in an alternative form. 
Although it is proved only for a special form of the Hamiltonian, the use of the 
Lagrangian has many desirable features. The action is a Lorentz scalar and the 
symmetry structure of the system is very easy to grasp. Besides it has the classi- 
cal path as the first approximation, so the transition from classical mechanics to 
quantum mechanics is transparent in this formalism. Therefore, we will use this 
as the starting point from which any equation of motion is derived. The propagator 
is expressed as a sum of all the path [q(t)] configurations connecting t;q; with tjqj 
with the weight eW fe ) s M. Classically, the path is determined as the one to minimize 
the action, but, in quantum mechanics, any path connecting the end points is pos- 
sible, though the action of a path far from the classical one generally gives a large 



4) The Gaussian integral can be extended by analytically continuing \A\ to |A|e'"(— jr/2 < 8 < ji/2) 

/ d « eHAIs2 =/5 -* C dqeTiW= f^\ (io - 55a) 



a variation of the Fres 



■m 
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value relative to ft, hence a small weight because of the oscillatory nature of the 
factor e ( '/ A)s M. If there is more than one path, as in the simple and familiar two-slit 
experiment, both paths contribute and the interference is an important aspect of 
quantum mechanical treatment. If the Hamiltonian is not of the type Eq. (10.53), 
we may still use Eq. (10.56), but in that case we have to be careful in defining the 
path integral measure T>q. 

10.3.2 

Equivalence with the Schrodinger Equation 

From the form of the path integral, it is hard to think there is any connection 
with the Schrodinger equation, but we have developed the formalism starting from 
the Schrodinger equation. Both forms are known to reproduce the quantum me- 
chanical phenomena correctly. We show here that the path integral includes the 
Schrodinger equation. The latter is a differential equation, so we examine an in- 
finitesimal form of the path integral. For a small time difference tf = t; + e, the 
propagator in Eq. (10.56) is expressed as 

<*- + -« "'"M^)'MH?(^) ! 057 

As the propagator describes the propagation of a wave function and satisfies 

JyK(t + £ ,q-ty)n>(t,y) (10.58) 

we can insert Eq. (10.57) in Eq. (10.58) and obtain 

/ m \i/z [ , [ i [m /q-y\ 2 

-v(I±l)|], M 

Changing the integration variable to z = y — q, we can rewrite the expression as 

^ +£ ^ ) =(2^i) 1 7 fcexp (S z2 ) exp B £V (^ + f)] 

(10.60) 
As e is a small number, the dominant contribution will come from the region 



^£ 



10.3 Feynman's Path Integral I 279 

In this region the first term is of the order of 1. We can expand the second and the 
third terms in £. Note each power of z" gives a factor of e" /2 . Then 



= (iS^ff dz -p G£ z2 ) i 1 - r v <*> + °H 

x U(tq) + zxp'{tq) + y V"(*«) + °( z3 )l 
+ zV'(tq)+^V"(tg)+0(z 3 ,£z)l 



(10.63a) 
(10.63b) 
(10.63c) 



Each term can easily be integrated and the results are 

/_l dzZexp (^ z2 ) = ° 

Substituting the above equation back into Eq. (10.62), we obtain 

xp(t + e,q) = ip(tq) + ^ V"(*«) " jV(q)ip(tq) + 0(e 2 ) (10.64) 

In the limit e — >■ 0, we obtain 

3V(to) f h 2 d 2 \ 

lh dT = ("2^9^ + y(?, j ^ 9) (m65) 

Therefore the path integral contains the Schrodinger equation. 

10.3.3 

Functional Calculus 

The path integral is a functional integral, which uses a type of mathematics readers 
may not have encountered. Perhaps it is useful to introduce basic operations of the 
functional and to evaluate simple examples for future applications. The simplest 
functional we know is the action functional defined by 

S[q] = JJ dtl(q,q) (10.66) 

Unlike a function, which gives a value for a particular point q [or q = (q 1( • • ■ , q^)], 
the functional gives a value for an entire trajectory q = q(t) along which the in- 
tegration is carried out. If the trajectory connecting q; = q(t{) and qj = q(tj) 
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is different, the value of the functional is different. Thus the functional has the 
generic form 

F[/] = J dqF(f(q)) (10.67) 

Traditionally square brackets are used to denote the functional. Some people call 
F(f(q)) a functional, meaning a function of functions. Note that, unlike our defini- 
tion Eq. (10.67), which maps a continuous set of variables to one number, it maps 
to another continuous set of numbers. The notion of derivatives is also used in the 
functional. We define the functional derivative by 



= d(q-y) 



tF[f(q)] = F[f{q) + ed(q-y)]-F[f(q)] 

6f(y) «-h> 
It follows from the above equation 

*f(Y) 

Problem 10.1 
Prove that if 

F[/] = / dyF(f(y)) = J dy {f(y)} n 



(10.68) 



Problem 10.2 




Prove that 




dS 

6q(t') ~ 


d /3L\ 91 1 
dt \dq) t=t , dq\ t = 


where S is the ac 


tion given by Eq. (10.66). 


Problem 10.3 




Define 





'-/ 



F*[f} = J dyG(x,y)f(y) 
Here, x is to be regarded as a parameter. Prove 



We can make a Taylor expansion of the functional, too: 
. iS[q] \ 



S[q] = S[q c + r,]=S[q c ] + j dtr,(t 



- 2| f dhdt 2 r](h 



dq(h)dq(t 2 )\ q=qc 



The form of the Taylor expansion Eq. (10.74) can be understood most easily if we 
divide the time integral into N segments (see Fig. 10.2): 

S[q]= P dtL(q(t),...)=j^e j L(q j ,.-) 

h f^[ (10.75) 

= F[q , q\, q 2 ■ ■ ■ , qN-i, <?n] . ?o, <?n fixed 

In this form, the functional becomes an ordinary multivariable function: 

(qo,qi,q2,---,qj,---q N ) 

Then the Taylor expansion of the functional can be carried out using the ordinary 
Taylor expansion: 

F[q + rj}= F(q + Vo>qi + Vlt ... ,q N + f]N)\, l0= , IN=0 

'■■"--•'■'-ji^tL 

Going back to continuous variables 

jk^h, Vj t (jk = l,---N)^ri(t k ), J^^ f dt k' k<h<t f (10.78) 




Figure 10.2 Functional differential path. 
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the second term in Eq. (10.76) with n = 1 becomes 



v^ dF ^ f 1 9F1 /• J . , ,6S[q] 

E^ = X>[-^ = J ^^ )JW) (10 .79) 

Note, in the second equality, (l/e)dF/dqk is well behaved because of Eq. (10.75) 
and gives the second term in Eq. (10.74). Other higher order terms can be derived 
similarly. 



10.4 

Propagators: Simple Examples 

Here we calculate the propagator in some simple examples to gain familiarity with 
the path integral method. 

10.4.1 

Free-Particle Propagator 

The free particle is the simplest example. In fact, the propagator can be obtained 
easily without depending on the path integral. For the sake of later applications, we 
derive the result in two ways. 

(1) Canonical Method The definition of the transition amplitude Eq. (10.38) gives 

K &ee (f,i) ee K (t f q f -t iqi ) = (t f q f \t iqi ) H=Ho = (qj^-^^f-^q,) 

(10.80) 
The Hamiltonian of the free particle is given by 

Ho = hi (ia81) 

It does not contain coordinate operators, therefore it can be diagonalized by simply 
inserting the momentum basis in the intermediate states: 

Ko(t f q f :t iqi ) = Oitf-t,) j ^^(q f \p')(p'\e-i H ^f-^\p)(p\ qi ) 
x lo{t f - U)2jthd(p - p') exp {~~(tf ~ M 

= ^-^2^^H* n WFw] (ia82) 
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where we have used f dze~ Az = ^/jt/A and its analytical continuation to A = 
\A\e lB , (jt/2 < 8 < — jt/2). It expresses the well known fact that a localized wave 
packet spreads with time. 

It is often more convenient to work in momentum space. The boundary condi- 
tion K = for t — t' < can be explicitly taken into account in the momentum 
representation. Using the integral representation for the step function 

e(t)= 2^£L^S (m83) 

and inserting it into Eq. (10.82), we obtain 

f dp dco 1 

K {tq:t'q') = / y^r— r~ 

J 2nn 2nx a> — xe 

nr / v 2 \ i) (ia84) 

xexp|-|p( ? -q')-(^-^J(t-t')Jj 

In this form, the energy no longer takes its on-shell value p 2 /2m, but any value 
E = p 2 /2m — ha>. So we use the energy variable to express the propagator 

*(*« : tV) = / ^^^ (W) " £ "" t,)] F , i!? )+ < 10 - 85 ) 

J 2jin 2nn E — (p 1 /2m) + i£ 

This is the momentum representation of the free propagator. In field theory, we 
will encounter its relativistic counterpart. 

Note: Classically, the exponent of the free propagator can be expressed as 
(i/h)(pq/2){p = m(qj - qi )/(t f - t t ) = mq,q = q f - q-), but quantum me- 
chanically, the momentum and the position cannot be specified simultaneously. 
The action has to be written in terms of the initial and final coordinates and the 
elapsed time only, as expressed in the last equation of Eq. (10.82). Let us compare 
the result with the classical relation. The solution to the action 

S[q] = J dt—q 2 (10.86) 

is given by solving the Euler-Lagrange equation, which is written as 

mq = 0^ q = v = constant (10.87) 

Therefore, the classical action gives 



S[q d ] = j f dt 1 - 

Thus the propagator is written as 



^-v-zagf 



l2jtih(t f -ti)} 



This is a special example because the quantum solution is completely determined 
by the classical path. 
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(2) Gaussian Integral The second method is a brute-force way, but provides a good 
exercise for path integral calculation. Since the Hamiltonian contains the momen- 
tum in quadratic form, we can use Eq. (10.56) for the path integral calculation. The 
action is given by Eq. (10.86). Then the propagator is calculated to give 



Kfr ee (f,i) 

I m \N/2 f 
= 1^0 \2jtihe) J d ?i--- d ?N-iexp 

/ m \N/2 r 



» y m ( 9J -1j- 
h ^ 2 V e 



Using _/" dzi 
0<tt/2),w 



= -y/ir/A with its analytical 
n evaluate the integral. Changing the integration value to 



(10.90) 
(-jt/2 < 



^(itti) * (1 °- 9 

we have 

N 7 r n ai <ia9 

x j d Vl • • • dy N _! exp - £ (vj i- yj-i) 

Let us integrate the jth integral and see what relations exist between different j's: 
Ij = J d yj dy j+1 exp I - Uyj - yj-i) 2 + (y j+1 - yj) 2 

+ (yj+2-yj+i) 2 [J 

= /„ yj(Jyj+1 =p[-{2( yj -a±l±fcl) ! 

Integration over yj will take Ij to the form 

i t 71 ^' 2 f a r ) 3 ( 2 Yj+2 + Yj-iY 

f J = (l) j^ +1 exp[-j-(y J+1 - 3 j 



=(?)'"(¥)" 



ex P -^(yj+2-yj-i) 
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From the above calculation, we see that repeating the integration j times starting 
from j = 1 gives 



f-h"-*"-^r(fT-m 



x exp ~T^7 (yj+1 ~ Yo ' 

= [irr] 1/2exp [-7TT (yj+1 - yo)2 ] (m94) 

Inserting Eq. (10.94) into Eq. (10.92), we obtain 

*[(Vf-{->--i] 

= l^ (t^nT exp [^WS {qN ~ qo)2 ] 

= [ = l 1/2 ex P ri m ^-^l (10.95) 

L2wift(V-* i )J P U 2(t f -ti) J l ' 

where we have used Ne = tj — t;, q = ?>, ?n = ?/ in the last equation. Note all 
the potentially singular terms have disappeared from the expression. In the limit 

K 6 et{f,i) = (tfq f \t i q i ) ^X d{q f -q t ) (10.96) 

which is nothing but the orthonormality relation in the coordinate basis. 

10.4.2 

Harmonic Oscillator 

Next we consider the one-dimensional harmonic oscillator. This is an example that 
can also be solved exactly. In fact, the method and conclusion are valid for any ac- 
tion that contains variables up to quadratic terms, and thus have very general uses. 
In the following, we consider a harmonic oscillator interacting with an external 
source described by the Lagrangian 

l= l -mq 2 - l -moj 2 q 2 + ]q (10.97) 

The time-dependent external source J(t) can be thought of as an electric field if the 
oscillator carries an electric charge. The well-known result for the free harmonic 
oscillator can be obtained from this system in the limit J(t) —> 0. Furthermore, 
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if the source is time independent, the problem can also be solved exactly. In this 
case the system is nothing but an oscillator with its equilibrium position displaced, 
because the Lagrangian can be rewritten to give 



This is the case, for instance, of a suspended spring under the effect of gravity. 

The Euler-Lagrange equation for the action Eq. (10.97) is determined by the re- 
quirement 

8 M =0 (10.99) 

which gives the equation of motion to determine the classical path q c i(t). 

mq d +mco 2 q d = J (10.100) 

In quantum mechanics, q(t) can take any path, but it is convenient to expand it 
around the classical path. Define the quantum fluctuation path ?;(t) by 

q(t) = 3d(t) + r,(t) (10.101) 

Because of its quadratic nature, the Taylor expansion of the action terminates at the 
second power of t](t) 

S[q]= S[q d + rj] 

6S[q\\ 



= S[q d ] + [ dt V (t) - 



dhAhv{h)v(h) -. 



' 2 S[q] 



'<J*(ti)«J*te)l,=, d 

The second term in Eq. (10.102) vanishes because the classical path is where the 

S[q] = S[q d ] + i / dhAhri(h)fl(h) . & - f, [g] /t - 1 (10.103) 

2! J oq(ti)bq(t 2 )\ q=q<i 

Calculating the functional derivatives, this becomes 

S[q] = S[q d + V ] = S[q d ] + j J ' it [i)(t) 2 - to 2 ); 2 ] (10.104) 

The variable r/(t) represents the quantum fluctuations around the classical path. 
As the end points of the trajectories are fixed, they satisfy the boundary condition 

V{H) = V(*f) = ° ( 10 - 105 ) 
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As the harmonic oscillator is an important example, we calculate the propagator 
again in two ways. 

(1) Matrix Method 

Summing over all the paths is equivalent to summing over all possible fluctuations 
subject to constraints Eqs. (10.105). The propagator is expressed as 



^ scl (/;i)^(t / q / it i?> >| harmonicosalk to r 

X6Xp [H?/ dt{?)(t)2 ~ Cy2,72} |] 



(10.106b) 



We have the boundary condition y = y N = because rj(ti) — rj(tf) = 0. The 
exponent is quadratic in y and can be rewritten in matrix form: 



E (yj-yj-0 2 -* 



= E yj M jkYk = V T My 

(10.107) 



- » jb 

fa a fe 
b a 
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M is a symmetric or more generally hermitian matrix and therefore can be diago- 
nalized by a unitary matrix U. Then 



y T My = y r U^'M D Uy = 



= X>^ 



The Jacobian for the change of the variables from yj t 
transformation is unitary. The integration over all Zj giv 





r ^-v i 


12 r 


W (N-1)-|V2 

detM J 




Liij-^J 


4 


Substi 


uting Eq. (10 


110) back into Eq. (10.106) gives 




Kosa(f,i) = 


lim 


f m xN/2 (2iti£\ {N - 1)l2 




x/ t 


i • • • dy N — i exp[— y ' My] 








( m V'V^.1 




\2jri/;£detM/ 



s unity, because the 



The action has an interaction term, but the result is still governed by the classical 
path. This is because the Lagrangian contains only up to quadratic terms. In that 
case the path integral over the quantum fluctuation only changes the normalization 
factor, which appears as y'l/e det M. It is clear that the path integral can be defined 
only if the matrix M does not have any vanishing eigenvalues. 

It remains to evaluate the determinant of the matrix. Denoting the determinant 
of a j x j submatrix as Dj, and inspecting Eq. (10.108a), we see it satisfies the 
following recursion relations: 

D j+l = aDj -fe 2 Dj_! (10.112a) 

D_i = 0, D = 1 , Di = a (10.112b) 

We note that we can reformulate the recursion relations also in a simple matrix 
form as 



:][£,: 



Successive insertion gives 



(N-2) fa, 



L D oJ 
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(10.114) 



We can determine eigenvalues of the basic 2x2 matrix in Eq. (10.114) easily. From 
detr" 2 3.2=° (10.115) 

we obtain 

X ± = -(a ± Va 2 -4b 2 ) (10.116a) 

a = X + + X- (10.116b) 

A similarity matrix to diagonalize the basic matrix is also easily obtained: 



i r i/c -A_/c] 

X+-X- L-l/d A+/dJ 



(10.117) 






Li oj-'Lo a_J- 

Inserting this in Eq. (10.114), we obtain 



(10.119) 



Inserting Eq. (10.117) and Eq. (10.116) in Eq. (10.119) 

xi-xl 

Dn-i = l _ A _ ( 10 - 120 ) 

This formula is consistent with Eqs. (10.112). Using Eq. (10.116a) andEq. (10.108b) 

= (-4e 2 oj 2 ) 1 ' 2 = lieoj (10.121a) 

2(l-^) + 2^ ^ 



(10.121b) 



= [1 



h 0(e 2 )] N 
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.■j-MJ'('-^- a i' 

- [ 2 J 2 (10.121c) 

= [l- i£ co + 0(£ 2 )] N 
Substituting Eqs. (10.121) in Eq. (10.120), we obtain 



lim eD N -i = lim e 



= J™ ^[(l + i£<w) N -(l-ie«) N ] 



We obtain the propagator for the harmonic oscillator as 

ir if;\— 1™ ( ™ "\ ,>£S[<3d] 

jSL ^TtihEdstM) 

m=t™ ti (10.123) 

\2jzih sin coT) 
Note all the potentially singular terms have disappeared and the end result is finite. 

Problem 10.4 

Show that for L = \mq 2 — ^ma) 2 q 2 with boundary condition q(t;) = <j,, q(tf) = 

9/ 

e i*[«] = expT l ™" ujT {(q 2 f + l}) cos «T-2 ?/?I }J (10.124a) 

-«-(=)''' (10 . mb) 



(2) Operator Method 

It is a well-known fact that a linear operator 5 ' has matrix representations and thus 
the matrix method developed in the previous section can easily be extended to the 

5) If an operator satisfies the relation 

6(0|Vi) + &IV» = «0|V'i) + fc6|V'2> (10.125) 

it is said to be linear. 
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case of operators. Here, we calculate the determinant of the hermitian operator 
explicitly, which has more applications later in field theory. The important part of 
the path integral for the harmonic oscillator has the form 

I = J V V exp ^ J ' dt(i ? 2 - © V)l (10.126) 

By partial integration, it can be rewritten as 

1 = JV V exp [i f^ ' dtrj[t) |-iL - co 2 j 7 (t)l (10.127) 

To handle the operator as a matrix we introduce two mathematical tricks. 

1) First we have to make sure the integral is well behaved i.e. convergent. This is 
made possible when the function has a certain property so that we could analytically 
continue into the complex plane by rotating the time axis t — >• te~ lB . This is called 
Wick rotation. The technique is widely used and when we set the rotation angle 
6 = Jt/2 or t — >• —ir, and correspondingly d 2 /dt 2 — > —d 2 /dz 2 , we have moved to 
the Euclidean space from the Minkowski space. Na mely, 

-S E (x,T) = iS u (x,-ir) (10.128) 



The sense of the rotation is completely fixed by the singularity structure of the 
theory. We shall discuss it further later at Sect. 10.6 when we define the generating 
functional. Then we have the Euclidean action 

S E = J ' dT V (z) \- JL + oj 2 ^ r,(t) = J ' dz rj(r)A V (r) (10.129) 

and the path integral 



■/ 



Vqe~* SE (10.130) 

becomes a well-behaved integral as long as S E is positive definite. After performing 
the calculation, we can go back to the original Minkowski space by setting x back 

2) To show the matrix nature of the operator A, we start as usual from the discrete 
time interval. 

If we divide the Euclidean time interval into N segments and put 

e= £ ' = " N ' . *lj=V(Je). Vi = V(kB') 

d 2 1 < 10 - 131 > 

■faaVW = -^(Vj+i ~ 2 Vj + V]-i) 

The time integral is expressed as the continuous limit of the discrete sum 

S E = - J2 ei] [^(^J+i " lf lj + Vj-i) " <o 2 *lj~\ 

J (10.132) 

= J2 ££ 'VjA jk t] k = ^££'7 T Aj/ 
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where 



In the last line of Eq. (10.132) we used bold letters for r] to emphasize its vector 
property r\ = (rji, ■ ■ ■ , rj N ). In the continuous limit, this gives 

S E = [ dTT, T (i)A(T)T,(T)= [ dxAx't, r (x)A(x,x')r,(x') 

where we have used the relation 

lim -^f- = d(x-x') (10.135) 

In this form the operator is clearly seen to be a matrix whose elements are speci- 
fied by two continuous variables. Referring to Eq. (10.133), A(x, x') is a symmetric 
operator and in the proper basis satisfies the eigenequation 



As the eigenfunction constitutes a complete basis, the variable r] can be expanded 
in terms of <pj 

The boundary condition ?7(t ; ) = rj(Xf) = translates to <pj(Xi) = 4>j( T f) = °- 
The normalized eigenfunction is easily obtained as 



(h:( T )=J sin — (x-Xt) j =!,-■■ ,N -1 (10.138a) 

Aj i = [ JJt ) +w 2 (10.138b) 

We note that the eigenvalues are positive definite and ensure the convergence of 
the Euclidean integral. Note, also, that if we assume the time integral is discretized 
into N intervals as before, there are N — 1 intermediate time points and there can 
be just as many (N — 1) independent coefficients cij in the Fourier expansion in 
Eq. (10.137). Therefore it can be considered as an orthogonal transformation to 
diagonalize the matrix A. Equation (10.137) is still valid if N — >■ oo and remains an 
orthogonal transformation that keeps the value of the Jacobian at unity. When we 
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change the path integral variable from r] to aj , we obtain 

j dxtfAt,) = j dxdx' £ a j( t>j(r)A(x,x')a M <j>^x') 

= £:vs 

= C(detA)~ 1/2 (10.139) 

The determinant of the hermitian operator is expressed formally as 

. / d2 



= detA = Yl X J 

-n(^)'n[i + m : 



rSmhcofrf — T;) 



■n(^. 



C, C' are divergent constants that do not contain dynamical variables and can be 
absorbed in the normalization constant. Analytically continuing this back to real 
time, we obtain 

/ d 2 2 \ T =it , ( d 2 2 \ . 2 smco(t f -ti) 

(10.142) 

where N' is a normalization constant. It may be divergent, but as long as it is inde- 
pendent of dynamical variables, the actual value is irrelevant for physical outcomes, 
as will be clarified in later discussions. As a result we obtain 



W^U>(-£-- , )'M !! ^n 



'7V 

(10.143) 
6) We used the formulae 



■n 
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The constant N can be determined by noting that, in the limit co -*■ 0, the propa- 
gator should agree with that of the free particle. The final result is 

Kosa(f,i)= ' 



\_2jtih sin co(t f -ti) J v ' ' 

which agrees with what we obtained using the matrix method. Our result was ob- 
tained for the harmonic oscillator, but the result can be extended to general quadrat- 
ic actions of the form 



= yZtyexpQjT' dt V r (t)ATi(t)\ 



So far, we have used only real variables. The path integral can be generalized to 
complex variables. We note 

— = f dxe~ axl P f dye-^l 1 = f dxdytT^+^l 2 (10.146) 
Introducing a complex variable z = (x + iy)l~Jl, we write 

— = - dzdz*e~ az z (10.147) 

Note, z and z* are independent variables. Generalizing to n complex variables and 
writing z = (zi,z 2 ,...), 



fw.-** = fr* 



detA 



(10.148) 



In this case, it goes without saying that the N-dimensional symmetric matrix 
should be replaced by the hermitian matrix. Generalization of the path integral 
formula Eq. (10.145) to complex variables is now straightforward. Introducing the 
complex variable <p(t) = (rj + i|)/>/2 

I = J V<p*V<p exp U j ' dt (p1(t)Ap(t)\ = ^ (10.149) 

where A is a hermitian operator. 



10.5 

Scattering Matrix 

In this section, we try to calculate the scattering amplitude when we have a scatter- 
ing potential in the Hamiltonian. In general, the propagator is not exactly calcula- 
ble and we rely on the perturbation expansion. This is valid when the potential or, 
more precisely, when the time integral of V(tq) is small compared with fi. 
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10.5.1 

Perturbation Expansion 



The propagator is given by Eq. (10.56) 

K(t f q f ;n qi ) = C j Vqexplj j dt^mq 2 - V(q)U 

=^(^r7--7 d9i ---^- iex 4^l : 1 {Tj " Vi} ] 

(10.150) 

where Tj and Vj are the kinetic and potential part of the Lagrangian at t = tj. As 
the action in the path integral is a classical action , the Lagrangian can be separated 
into the kinetic and potential terms. We then expand the potential part 

exp(-^' dtV(tq^=l- l -EV(h,q^) + j s i- l -EV(t k ,q k )\ •••(10.151) 
When Eq. (10.151) is substituted back into Eq. (10.150), we obtain a series expan- 
se = Kq + K x + K 2 + ■ ■ • (10.152) 
The first term does not contain Vand is the free propagator: 

K =cjvq exp^m^j 

,,, , (10.153) 

Ki includes only one V(tfc, qy) and is expressed as 

Y ]5o ^ e / dqk 

k I dqi + i — dq N -iexpi ^ ^(gj - ej-i) 2 1 
C fc / dqi - • • dq k _i exp j — J^tij ~ li-rf \ 

= ~lf dt f d « K ^J1f tq)V(tq)Ko(tq; Uq,) (10.154) 

where C = ^/m/2jtihe.As K(tjqj; tq) vanishes if t > tj, and K(tq; t;g;) if t < tj, 
the integration over time can be extended to all values of t to give 

Kl = ~lJ- dt L d lM^<U^l)y(H)MH^i1i) (1°- 155 ) 



•[< 
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Similarly, we can prove that the second-order term is expressed as 

K 2 =(--| / dt x dt 2 / da l dq 1 

\ hj J.^ ./_«, yl H2 (10.156) 

x [Ko(t f qf, t 2 q 2 )V(t 2 q 2 )K (t 2 q 2 - t l q 1 )V(t 1 q 1 )K (t l q l ; t l?I )] 

Analogous expressions hold for all Kj in the expansion Eq. (10.152). Note, there 
is no 1/2! in Eq. (10.156). The reason for this is that the two interactions occur at 
different times, but they are integrated over all space-time and they are indistin- 
guishable. The effect is merely to compensate 1/2!. Similarly, there are j ! different 
combinations that give identical contributions, and no 1/j ! appears in the expres- 
sion for Kj . Equation (10.152) is now expressed as 

K(t f q f ;t iqi ) 

= Ko(t f q f ; t iqi ) - % - J dt j dqKo(t f q f ; tq)V(tq)K (tq; t 1?I ) + ■•• 

= Ko(t f q f ;t iqi )-± J dt f dqKo(t f q f ;tq)V(tq)K(tq;tiqt) 

(10.157) 

The first equation is the perturbation expansion of the propagator, called the Born 
series. The second equation is obtained by replacing K in the second term by 
K, which is equivalent to including all the successive terms. It is exact and is an 
integral equation for the propagator. 

We have elaborated on using the path integral method to derive the perturbation 
series. In fact, it can be derived directly from Eq. (10.35). Separating the potential 
from the Hamiltonian H = H + V it can be rewritten as 

^^- Ho^j K(t iqi ;t 2 q 2 ) = ifid(t l -t 2 )d(q l -q 2 )+V(t l q l )K(t l q 1 ;t 2 q 2 ) 

(10.158) 

Since if satisfies the same equation with V = 0, the equation can be transformed 
to a integral equation that reproduces Eq. (10.157). 

Using the propagator, we can now express the time development of a wave func- 
tion. Inserting Eq. (10.157) in Eq. (10.40), 

V(tf<lf) = f dqtKitfq^UqWiUq,) 

= / d qi Koitfqptiqjipfcqi) 

- 1 j dtdqj dq, Ko(t f q f ; tq)V(tq)K(tq- kq^^) 

= I d qi Kb(t/g / ;t i g i )V'(ti8j) 

( dtdqKo(t f q f ;tq)V(tq)Tp(tq) (10.159) 



-U' 
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Note, the time sequence is such that tf > t > t j; but the integral can be made 
over the entire time region from — oo to +00, because the existence of the retarded 
propagator guarantees the vanishing of the integrand for t > t f or t < t;. Here 
we have changed from one space dimension to three. This is an integral equation 
for the wave function. Now we assume that for t ->■ —00, \p becomes a free plane 
wave function, which we denote as (p. Then, the first term of the rhs of Eq. (10.159) 
is also a plane wave, because the free propagation of a plane wave is also a plane 

Wfif) = Wfif) ~\\ did 1 Ko( W tq)V(tq)ip(tq) (10.160) 

As ip(tq) obeys the Schrodinger equation, namely 

(ij- t - H }ip(tq) = V(tq)Wq) (10.161) 

where H is the free Hamiltonian and <p(tq) is the solution to the free equation 
( V = 0), the propagator must obey the equation 



{ l Tt ~ Ho ) Ko{tq ' t,q ' ] = ih5[i ~ t,)di{q ~ q ' ] 



This is the equation for the Green's function of Eq. (10.161), which reproduc 
Eq. (10.35). 

Problem 10.5 

Using the free-particle propagator Eq. (10.82) or Eq. (10.95), prove that 



Extension to three dimensions is straightforward. 



10.5.2 

S-Matrix in the Path Integral 7 ' 

Now we want to apply our results to more practical situations, namely how to re- 
late the wave function to experimental measurements. In a typical scattering ex- 
periment a bunch of particles is provided with a definite initial condition, say, fixed 
momentum p ; , they illuminate a target and the probability that they are scattered 
to a specific final state, at momentum pj, is measured. The incident and scat- 
tered particles are considered free in the distant past t -*■ —00 or in the far future 
t -> +00. We represent the initial- and final-state wave functions of particles with 

7) Compare the discussion here with that in Sects. 6.1-6.3. 
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momentum p i and p j by 






--(H\v f ) 



(10.164b) 



One problem with the expressions is that the plane wave spreads over all space 
and time, including the scattering center V(tq), so the particles can never be free. 
We could use a wave packet formalism to circumvent the problem but usually this 
involves complicated calculation. The conventional way around the problem is the 
adiabatic treatment, in which the potential is switched on and off slowly as a func- 
tion of time. Namely, we replace Vby Ve _t|t so that V = at t = ±oo. 8) Assum- 
ing that is the case, and with the initial conditions ip = ip in , V -*■ for \t\ -*■ oo, 
the wave function at time t = tf is expressed as 



- I y dtdqKo(t f q f ;tq)V(tq)y + (tq) 



This is the same equation as Eq. (10.159) except V (*>><?/) i n me nrst term 
is replaced by ip in and the implicit assumption that | V| ->■ at t -*■ —oo. The 
superscript on ip + (tq) denotes that it corresponds to a wave that was free at 
t ->• — oo, which is the result of the retarded nature of the propagator such that 
Ko(tq; t'q') = for t < t' . To eliminate if>+ on the rhs, we make successive 
iterations, using the first term as the first input to ip~*~ in the second term. The 

^ + (t/«/) = / d«i «b(*/9/5 *i«i)^ta(**ft) 

- 1 J dtdqdq, Ko(t /?/ ; tq) V(tq)K (tq; t 1 q [ )^ in (t,q l ) + ••• 



8) This is equivalent to introducing a small negative imaginary part for the energy, i.e. E -*■ E - 
turn is Feynman's recipe for handling the sfj in the propagator. 




Figure 10.3 First- and second-order scattering diagrai 
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We are interested in the transition amplitude from the state | tpi n ) at t = — oo to 
I Vout) at t = oo, which is called the scattering amplitude. It is defined as 

S f i = (i/wlV + h^oo = f dq f {ip ttat \t I q I )(t f q f \Tp+) 

= / dq f V^t(*/«/)V' + (t/8/) 

- 1 y dtdq f dqd qi Tp* ut (t f q f )K (t f q f ; tq) (10.167) 

+ (-r) dt 1 dt 2 dq f dq 2 dq 1 dq i 

xv&(t/*/W/*/;fe«2)V(fe.«2)JM*2fc;*i«i) 

xV(ti« 1 )Kb(*i« 1 ;*i€i)Vta(*i« J ) + - 

where • • • are higher order terms. The formula can be depicted as in Fig. 10.3. 

1. Attach ipin(ti, qi) at point (t;, <j ; ). 

2. Draw a line from (t ; , cj;) to (t, £j). Attach Ko(t;q;; t, q) to the line. 

3. Attach the vertex [{-{) V(t, q)] to the point (t, q). 

4. Draw a line from (t, q) to (tj, qj). Attach K (tq; tj, qj) to the line. 

5. Attach ip ou t(tf, qj) at point (tj, qj). 

6. Integrate over time t and over all the intermediate positions q it q,q r. 

This completes the first-order scattering process. The corresponding picture is 
drawn in Fig. 10.3 (left). The second-order scattering process can be drawn in a 
similar manner. The ordering of the factors is such that in writing each factor one 
goes from the bottom upwards. In the nonrelativistic treatment, one does not go 
downwards (backwards in time) in Fig. 10.3. The correspondence is called the Feyn- 
man rules. 5) In nonrelativistic quantum mechanics, which we are dealing with at 
present, the pictorial representations shown in Fig. 10.3 are hardly necessary, but 
in quantum field theory they are a great aid to calculation. 
As Problem 10.5 shows 

/ <i?iKb(t/?/;ti?i)Vin(ti«() = ipin(tfq f ) 

J (10.168) 

I dqiip* ut (t f q f )K (t f q f ;t iqi ) = t/C(tiq ; ) 

9) In field theory, particles with negative energy can be formally interpreted as going backwards in 
time. The relativists propagator includes both retarded and advanced propagation. One 

can go upwards or downwards freely in time and need not worry about the time sequence. The 
Feynman propagator takes care of it automatically (see discussion in Sect. 6.6.2). 
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Using Eq. (10.168), we can rewrite Eq. (10.167). The first term gives simply d 3 (p j— 
Pi), which means no scattering. The interesting part is included in the second and 
following terms. Here, we write only up to second terms, which is the first-order 
term in the power expansion of the potential V: 

Sfi = d\Pf -Pi)~ if dtdqip*(tq)V(tq)ip m (tq) 

= d\p f - p.) - ^-N 2 f dtdq e-^/'f-'*/') V(tq)e^r<-i- iE ^ 

(10.169) 

where N is the normalization factor. For the time independent Coulomb potential, 
the time integration merely gives a <5 function. Thus, the second term gives 

S fi - d f i = -23li8(Ef - Ei)N 2 I e^ {p '~ p f ) ' q V(q)dq (10.170) 

This is the well-known Born approximation in quantum mechanics. 

Problem 10.6 
Prove Eqs. (10.169). 



10.6 

Generating Functional 

10.6.1 

Correlation Functions 

So far we have considered the transition amplitude in the form of a path integral: 
{t f q f \tiqi) = N j Vqe* s M (10.171) 

Later on, we shall need matrix elements of operator products such as 
D l2 = (t f q f \q(t 1 )q(t 2 )\t i q l ) 

f r f tf 1 (10.172) 

= J Vq qi q 2 exp^iJ^ dtl(q)j t f > h > t 2 > t, 

where the boundary conditions on the path integral are 

9/ = «(*/) • *< = «(*i). «i = «(*i). 92 = q(h) (10.173) 

We can show that it is related to the two-point vacuum correlation function 

(Q\T[q(h)q(t 2 )]\Q) (10.174) 
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where | Q ) is the vacuum state in the Heisenberg picture. We add a hat, e.g. q, when 
we want to distinguish operators from their representations q, which are ordinary 
numbers. To evaluate Eq. (10.172) we break up the functional integral as follows: 

fvq=[dq l fdq 2 fvq t f > h > h > M (10.175) 

The main functional integral fVq is now constrained at times ti and t 2 (in addi- 
tion to the endpoints t j and t ; ). There qi = q(t{) and q(t 2 ) are held constant. Their 
integration has to be done separately and is placed in front of the main function- 
al integral. Then the numbers qi, q 2 can be taken outside the main integral. The 
main integral is now divided into three transition amplitudes: 

O12 = J d qi J dq 2 q iq2 J q{h) = qi Vq exp U J ' dtlj 

= J dq x J dq 2 q 1 <j 2 (t / q / |t 1 q 1 )(t 1?1 |t 2 <j 2 )(t 2 q 2 |tiqi) (10.176) 

= f dq t f dq 2 qiq2(qf\e- iH{t f- H) \qi)(qi\e- iH{h - t2) \q2) 

x( ?2 |e- iH ('-")| gi > 

We can turn the c-number £ji into a Schrodinger operator using q\qi) = qsl?i> 
[see Eq. (10.36b)]. The completeness relation f dqi\qi)(qi\ = 1 allows us to elimi- 
nate the intermediate state |cji)(qi|. The q 2 integration can be eliminated similarly, 
yielding the expression 

O12 = (q f \e-' H ^-^q s e-' H ^-^qs e - tH ^-^\qi) (10.177) 

The Schrodinger operator can be converted into the Heisenberg operator using the 
relation (10.36a). For the case t 2 > t\, qi enters the path integral earlier than q 2 . 
Therefore the ordering of the operators q(ti)q(t 2 ) in D X2 [Eq. (10.172)] should be 
reversed. Using a time-ordered product we can combine the two cases into one to 
give 

D 12 = { £f/ | e - iMt /T[q H (ti)gH(t2)]e lHti |t.> = («/t/|r[«H(*i)§H(fe)]|*i«i) 

(10.178) 

We see that the path integral has a built-in structure to give the time-ordered corre- 
lation function as its moments. A similar formula holds for arbitrary functions of 



(t/q / |T[0 1 (q H (ti))---0 N (qH(t N ))]|tigi) 

= NJvqO l (q(t l ))--0 N (q(tN))^ S[q] 



The salient feature of the transition amplitude is that we can extract the vacuum 
state by sending t i: tf to ±oo in a slightly imaginary direction (t ->■ t' = te~ >d ). 
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This will extract the vacuum state \Q) from \qj) and |g;), because we can get rid 
of all the n / states by making tf, t; large with a small negative imaginary part 
*/,*,-► ±oo(l- id): 

%|j££- W) 6_1 *"' l l fl )< fl l«*) + E e " iJ!;l *' l l n >< n l«'> 

=> lim \Q)((Q\q i )e- iE( >- M ( 1 - id ^ (10.180a) 

where E = (Q\H\Q). Any state | n) for n / has a larger damping factor than 
the n = state and hence the ground state is the only surviving state after 1 1; | is 
made large. Similarly, it is shown that 

(q f \e-' Ht I 

Using Eq. (10.180b) and (10.180a), the time-ordered correlation functio 
pressed as the vacuum expectation value: 



^}im Jt f q f \T[q(t 1 )q(q 2 )]\t l q i ) 

= ({q f \Q) e - iE -^--%Q\T[q(t l )q(q 2 )]\Q)({Q\q l ) e - iE '>^ 1 - id ) 

(10.181) 

(trgHT[g(ti)g(g 2 )l|t;g;) 
.-. (Q\T[q(t 1 )q(q 2 )]\Q) = lim N JHJl m IH y," „, . ., 

(10.182) 

The rhs is independent of the endpoints and therefore the lhs must also be in- 
dependent of the endpoints. The awkward extra phase and overlap factors can be 
eliminated by dividing by the same expression as D u but without g x and q 2 . Our 
final expression for the n-point correlation function is 

fVqq 1 ---q n exp\ifl f dtL\ 

(Q\T[q H (t 1 )...q H (t n )]\Q)= lim ^ =--L 

tf,ti-+±oo(i-i6) f Vq eX p It lH dtL\ 

(10.183) 

10.6.2 

Note on Imaginary Time 

Tilting the time axis is a mathematical trick called the Wick rotation (Fig. 10.4a). 
The direction of the rotation is mandated by the requirement that there are no 
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singularities of the integrand encountered in the rotation of the time a: 
the case for 



< d < n 



(10.184) 



For larger angles, the rhs of Eq. (10.180) develops essential singularities for t; — >• 
— oo and tj — >• oo. This rotation corresponds to an analytic continuation in the 
variable t. The real four-dimensional Minkowski space can be viewed as a subspace 
of a four-dimensional complex space (or of a five-dimensional space with three 
real space coordinates and one complex time coordinate). For d = ji/2, this corre- 
sponds to t = — it (t real) and the space becomes Euclidean. Physical results are 
obtained by evaluating the relevant expressions in Euclidean time because their os- 
cillatory behavior is converted to an adequate damping factor and is well behaved. 
Then after the calculation, one can rotate the result (r = it) back to the real time 
axis (Minkowski time t) to obtain physically meaningful results. The method works 
well only if no singularities are encountered on the way. 

We note that the end result of the Wick rotation and rotation back to Minkowski 
time is equivalent to addition of a small negative imaginary part in the energy. This 
produces the proper damping of the oscillating exponentials to make the integral 
converge. We know from our study of the Green's function in the complex energy 
plane (Fig. 10.4b), that the singularities have to occur at ±w T i£ to incorporate 
the causal boundary conditions (see Sect. 6.6.2). Therefore, an analytic continuation 
from Re(fe°) to Im(fc°) is meaningful only if the rotation is counterclockwise. The 
two rotations are equivalent since we can represent fe° -> ifid/dt. 

Furthermore, the Wick rotation also rationalizes the treatment of the scattering 
matrix in terms of plane waves, which are not local. The interaction is made to 
be turned on and off slowly without any energy transfer (i.e. adiabatic approxima- 
tion) so that the notion of an asymptotically free field at large time is justified (see 
arguments in Sect. 10.5.2). All the reasonings stem from the constraint that the 
transition amplitude has to be finite at large time. In the following, we shall not 
always indicate it explicitly, but it should be understood as being done. 






Figure 10.4 (a) Rotation of the time axis 
Rotation in the complex energy plane to ii 



ensure the convergence of the path integral, (b) 
Drporate the causal boundary condition. 
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10.6.3 

Correlation Functions as Functional Derivatives 

Next, we show that we can generate various correlation functions by simply adding 
an appropriate external source. For instance, a charged particle generates an elec- 
tromagnetic field, and its interaction is given by A f ,J''. The current J' 1 plays the 
role of the source to generate the interactions. We define a modified action of the 
form 

S[q,J]=S[q] + fJ dtq(t)J(t) (10.185) 

and further define 



Z[J] = {Q\Q)j = N jvqe* 5 ^ 



where N is a normalization constant to make Z[0] = 1. We know how to n 
the time axis in the definiti 
may add a small negative 
Lagrangian will achieve th: 



l of Z[J] to isolate the vacuum state. Equivalently, we 
aginary part to the Hamiltonian. Adding ieq 2 to the 



Then 



<5J(, 

Therefore, 



Z[J]=N f DqexpU f dt (I + Jq + ieq 2 )\ (10.187) 

dZ lIl = N f Vq id J^Jl e t^n = N [vq l -q( h )ei s ^ (10.188) 

h) J ft 6j(h) J ti 

I Vq l -q{h) e i™ = l -N(Q\q(h)\Q) (10.189) 

differentiation, we have 



<JJ(*i)lj= 

Repeating the functional differentiation, we ha 1 






- NiQlTfiWqmiQ) 



a general, we have 

d"Z[J] I 
<5/(*i)---<5J(*»)lj-o 



N{Q\T[q(h)---q(t n )]\Q) 
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Then we have 

Z W = fl^(j) f dh--dt n J(h)-J(t n ){Q\T[q(h)--q(t n )]\Q) 

(10.192) 

Namely, Z[J] is a generating functional of n-point time-ordered correlation func- 
tions or the Green's functions in the vacuum. It is known as the vacuum functional 
or the generating functional of the vacuum Green's functions. Using Eqs. (10.192) 
and (10.182), the time-ordered n-point function is expressed as 

{Q\T[q(t 1 )---q(t n )]\Q) = (-) l-*—- — 8 " Z ^J 1 (10.193) 

The usefulness of Eq. (10.193) lies in the fact that any scattering amplitude can 
be converted to n-point vacuum expectation values using reduction formula (see 
Sect. 11.5). Therefore, if one knows all the vacuum Green's functions, one can 
construct the S-matrix of the theory and therefore solve the problem. In quantum 
field theory, these correlation functions play a central role. 

It is convenient to introduce another related functional W(J) defined by 

Z[J]=e* WU] or, W[J] = -ihln Z[J] (10.194) 

Then taking the functional derivative, we obtain 

i^nl =(n\q(h)\Q) = (q(h)) (io.i95) 

where (• • • ) means the vacuum expectation value. We have learned that in the case 
of free particles and harmonic oscillators, the path integral for the transition ampli- 
tude is proportional to the exponential of the classical action [Eqs. (10.89), (10.123)]. 
For this reason W[J] is also called the effective action. The advantage of using 
W(J) over Z(J) is that it gives statistical deviations from the mean values. This can 
be shown by further differentiating Eq. (10.195): 

h 6 2 W[J] I 
~idJ(h)dJ(t 2 )\ J=0 

= (*V [ 1 d2z U] 1 dZ[J]dZ[J] l 

\i) [Z[]]5]{h)d](t 2 ) Z 2 m^i(ti)5;(t 2 )J ;=0 (10-196) 

= {T[q(h)q(h)])-{q(h))m2)) 

= {T[(q(h)-(q(h)))(q(h)-(q(h)))]) 
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We recognize this is the second-order deviation from the mean. The third-order 
mean deviation is given by 

p\ 2 ^W[J] 1 

\i) dj(h)dj(t 2 )dj(h)\ j=0 

_(tiV\ 1 ^Z[J] 1 d 2 Z[J] 6Z[J] 

W L Z[J] 6 ]{h)d ](t 2 )d ](h) Z 2 [J] 6J(h)dJ(t 2 ) 6J(t 3 ) 

1 d 2 Z[J] 6Z[J] 1 d 2 Z[J] 6Z[J] 

Z 2 IJ\ SM)dJ(h) dj(h) Z 2 [J] dj(t 3 )dj(h) dj(t 2 ) 

1 6Z[J]dZ[J]dZ[J] l 

zv\ dm) dj(t 2 ) dm) \ ]=0 

= (T [(q(h) - (q(h)))(q(t 2 ) - (q(h)))(q(t 3 ) - (q(h)))]) (10.197) 

We can go on to higher order deviations from the mean. In quantum field theory, 
W[J] is known as the generating functional for the connected vacuum Green's 
function, which will be discussed later in Sect. 11.6.7. 



10.7 

Connection with Statistical Mechanics 

A typical application of the path integral outside the field of particle physics is 
to calculate the partition functions in statistical mechanics. Once it is given as a 
function of the temperature and volume (or pressure), the internal energy, entropy, 
pressure and other standard thermodynamical quantities are obtained by simply 
differentiating the partition function. 

In a statistical ensemble of many particles that are confined in a volume V and 
in thermal equilibrium with a heat bath at temperature T, the probability of each 
particle being in a state i is given by 

*=r-'*- ^ = i> (10 - 198a) 

where fe B = 8.617 x 10 -5 eV K _1 is the Boltzmann constant. The partition function 
Z and the Helmholtz free energy F = U — T S are defined for a system in thermal 
equilibrium, hence the Hamiltonian can be considered as time independent. They 
are defined by 

Z = zZ e ~^ Ei =E^i e_/iA l i > =Tr[exp(-/m)] (10.198b) 

F = --InZ = -fe B TlnZ (10.198c) 
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Thermodynamical variables such as entropy and pressure can be derived from the 
partition function: 

S = -S =fc B /3 2 ^, P = -- (10.198d) 

9T| V b ^ 9^ dV { ' 

Just to refresh our memory of thermodynamics, we give a derivation of the 
Helmholtz free energy in the boxed paragraph at the end of this chapter. 
On the other hand the functional integral is used to calculate 



G(x, t:y)m (t,x\ti = 0, y) = (x\ exp [~jHJ \y) 

Eq '- 5 7^(2^) exp Kr dt{ ^" H(p ' 9)} ] 

(10.199) 

Namely, if one can calculate the propagator Eq. (10.199), all one has to do to obtain 
the partition function is "to replace it/h with /3 (or better to say analytically contin- 
ue to t — >• —ififi) and to take the trace". Note, the trace can be taken in any basis. 
In particular, if we choose the basis in the Schrodinger picture, we can write 

Z = J dq(q\ e -l ifl \q) (10.200) 

If we introduce an imaginary time t = it/h, denote the r derivative of q as q and 
set r; =0 for simplicity, the partition function is given by 

Z = f dq(q\e-^\q) = f dq [ VqV (-£-) 

r ffS "I (10.201a) 

xexp -j o dt{H( P ,q)-ipq}\ 

For the special case H(p, q) = p 2 /2m + V(q), this becomes 

Z=[dq[ Vqexp \-f dx\±mq 2 + V(q)\\ (10.201b) 

where 

_ / m \N/2 

Vq = N l ^L\2^h~J d ^ dc i2---dq N -i (10.201c) 

We have already encountered this type of functional integral in Sect. 10.6.2 as the 
Wick rotation. 

We add a few simple exercises in statistical mechanics just to remind the reader 
of some basic uses of the partition function. 
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Example 1: Free Particle Using Eq. (10.201b) and substituting t f - J; = -ififi, 
q f = qi in Eq. (10.95) 



1i = V 

(10.202) 



Z = [(10.201b) ]freepartlcle = / d? [(10.95)] t/ _ 1;= _^ = [j^Yf 
2nergy is gr 

? ta [(s 



Then the Helmholtz free energy is given by 

1/2 



Using Eqs. (10.203) and (10.198d), we obtain the equation of state for the ideal gas 
PV = fe B T. 

Example 2: Harmonic Oscillator 



(1) Calculate the propagator for the harmonic oscillator a 
-ifi/3 to obtain 



l2^sinh M ; (1Q204) 

x ex 4-2^W {( * 2 + y2) coshM) - H 

Hint: Use (10.124). 

(2) Calculate the partition function and the Helmholtz free energy. Show they are 

given by 



(3) Using the energy eigenvalues of the harmonic oscillator, derive the partition 
function directly by calculating Eq. (10.198b), and show it agrees with Eq. (10.205). 

(4) Derive the Planck blackbody radiation formula from Eq. (10.205). 
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Derivation of the Helmholtz Free Energy Having defined the partition function for 
an ensemble of particles in a volume V and in thermal equilibrium with a heat bath 
at temperature T, we will prove that the Helmholtz free energy is given by 

F = U- TS = -fc B TlnZ (10.206) 

Note, £; is a function of V, hence Z is a function of T and V. The internal energy U 
of a system is given as the expectation (or average) value of the energy. Denoting the 
probability of a particle in the state i as pi and using J2 Ei e~^ e ' = — 3Z/3/3, the total 
energy is given by 

v^ 1 v^ BP 1 32 1 31nZ| 

^-I>* = 2l> M =-^„ = --^rL ( 10 - 207 > 

pressure is gi 
e AV under i 

I V — 

7 ^ av e 



Similarly, the pressure is given as an energy change A U divided by an infinitesimal 
volume change AV under equitemperature conditions: 



= (10.208) 

When the volume of the system changes, two things happen. First, each energy level 
shifts slightly, and second, heat energy d Q flows in from the heat bath to keep the 
temperature constant. The energy increase d U is given by 

du = -PdV+ dQ (10.209) 

Noting d(PU) = /3dU + Ud/3, we can calculate dQJT from 

-Q = k B fi(dU+ PdV) = fe B [d(/3L/)- Udfi + fiPdv] (10.210) 

Using Eqs. (10.207) and (10.208), we obtain 

f-^im + ^P + ^iv] (10.2H) 

The rhs is a total derivative. Therefore writing dQJ T = dS, k B (3 = 1/ T and inte- 
grating we have 

S =— + k B In Z{T,V) a — , or F=U-TS (10.212) 

which proves Eq. (10.206). Using F = -(lnZ)//3, U = -31nZ/3/3| v = 
3(/3F)/3/3| v , we can rewrite the entropy as 

U-F , „{ d(BF)\ \ , „, 3F| 3F I 

s =— =^( -y- \ V - F )=w d v =-3r| v (ia2i3) 

From Eqs. (10.212), (10.207) and (10.198a), it can also be shown that 

S = -fc B ^p;lnp, (10.214) 

which is the well-known expression for the entropy. 



11 

Path Integral Approach to Field Theory 



Applications of the path integral method to field theory are given in this chapter. 
Scattering amplitudes of QED processes treated in Chap. 7 and the self-energy of 
the photon and electron already considered in Chap. 8 are rederived. 



From Particles to Fields 

Up to now we have discussed the one-particle system. We can extend the path 
integral method to systems of particles with many degrees of freedom. Consider 
a system of N particles having coordinates q r (r = 1, • • • ,N). Then the transition 
amplitude is expressed as 

{t f q f \t iqi } ^{t f q 1 f ,--q N f \tiql--qf) = N I ' \\Vq r (t)M® 

J r (11.1) 

q'f = q r (tf), q r i=q r {ti) 
The path integral on the rhs is now JV-fold. The action has the form 

s[q] = f 1 dtJ2nq r ,q r ) (H.2) 

and the integration is over all paths starting at qT at t = t; and ending at q r r at 
t = tj. The field (p(x) = 0(t, x) can be considered as an infinite set of q r indexed 
by the position x, namely 



?-/ 



To define the path integral, instead of dividing just time into segments, we divide 
space and time, and the Minkowski space is broken down into four-dimensional 
cubes of volume e 4 . Inside each cube at coordinate (tj ,xt,yi, z m ), (p is taken to be 

Elementary Particle Physics, Volume 1: Quantum Field Theory and Particles. Yorikiyo Nagashima 
Copyright © 2010 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim 
ISBN: 978-3-57- '" 



312 I 11 Path Integral Approach to Field Theory 

a constant: 

<P(j,k,l,m) = j>(t j ,x k ,y l ,z m ) (11.4) 

Derivatives are approximated by, for example, 

901 <i>(tj , x k + £, yi, z m ) - <j>(tj,x k , yi, z m ) 

dx \j,k,l,m~ e 

We replace the four indices (j, k, I, m) formally by the single index a and write 

X {0(tj , x k , yi, z m ), d,4>(t j ,x h , y u z m )} = £(<p a , d fl <p a ) = £ a (11.6a) 

S[<p} = j d 4 x£~J2 £4£ " < n - 6b > 

The path integral of the field is defined as 



=A m oo/n^ex P [if: e ^ 

= lim / fT ^</> fl exp l -E A Z a 



Real Scalar Field 



11.2.1 

Generating Functional 



Now we consider the real scalar field (p(x) = cp(t, x), which obeys the Klein- 
Gordon equation. From now on, we adopt natural units and omit h. We contin- 
ue to consider that the field is interacting with an external field J(x). Then the 
classical equation that <p c ] satisfies is 

[d fl 3' l + m 2 -ie]<p d = J(x) (11.8) 

We have introduced a small negative imaginary part to the mass squared to en- 
sure that the path integral is well behaved, as we discussed in Sect. 10.6.2. The 
Feynman's Green function is defined as one that obeys 

[d /i d^ + m 2 -iE]A r (x-y) = -d\x-y) 

M* " Y) = " [9„3" + m 2 - ie]' 1 6\x - y) = j - 



(In)* q 2 - 
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The second equation is written to demonstrate that A F is formally defined as the 
inverse of the Klein-Gordon operator. The prescription of how to handle the sin- 
gularity of the Green's function is dictated by the small negative imaginary part of 
the mass, which reproduces the recipe discussed in Sect. 6.6.2 to choose positive 
energy for particles going forward in time and negative energy for backward-going 
particles. The vacuum-to-vacuum amplitude is given by 

Z [J] = NJv<p exp \i j d A x U {3^03" - (m 2 - ie)<p 2 } + J</> j] (11.10) 

We have attached the subscript "0" to Z to denote that we have not included the 
interaction of the scalar field apart from the artificial external source and hence 
can be considered free except for the external source/. The notation Z[J] is saved 
for future use to denote the generating functional of interacting fields. We remind 
the reader that in the path integral is not the field satisfying Eq. (11.8), but the 
integration variable, which can have any value. We modify the action slightly, using 
the identity 

j d 4 x3,,03"0 = j d 4 x3,,(03''0) - I d 4 %03,,3''0 (11.11) 

The first term is a surface integral and vanishes if -*■ at infinity. Then Z [J] is 
expressed as 

Z [J] =N jv<p exp \-i j d A x 1^0 (3„3" + m 2 - ie) - J J] (11.12) 

To re-formulate the path integral, we use the fact that is an integration variable 
and we are free to shift the reference path. We choose the reference path as the 
solution to Eq. (11.8): 

0(x) -> <p'(x) = 0(x) + d (x) = 4>(x) - j d 4 yA F (x - Y)J(y) (11.13) 

The action in Eq. (11.12) can be rewritten as 

- i J d 4 x \±tf (3„3" + m 2 - is) 0' - /0'] 

= -iyd 4 x^0(3,3" + m 2 -i £ )0 

+ ^0 (3„3" + m 2 - is) c i + ^0 d (3^3" + m 2 - ie) 

+ ^0d (3 /( 3" + m 2 - ie) cl - J0 d - jA (11.14) 

The second and third terms on the rhs give the same contribution when integrat- 
ed over and are canceled by the last term because of Eq. (11.8). The fourth term 
is i J0 c i again because of Eq. (11.8). Then using Eq. (11.13), / d A x J0 c i can be 
rewritten as - ff d 4 xd 4 yJ(x)A ¥ (x - y)J{y). 
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Collecting all the terms, we can now write the action as 

S[0] = ~\j d 4 x 0(3„3" + m 2 - ie)0 - I J d 4 xd 4 y /(x)A F (x - y)/(y) 

(11.15) 
The second term does not contain <p, hence it can be taken out of the functional 
integral. Z [J] now takes the form 

Z [J] = N exp Li J d*xd*yJ(x)A f (x - y) J(y)l 

x /" X>0 exp T-- /" d 4 % 0(3„3'' + m 2 - iej^l (11 ' 16) 



The second factor, formally giving (det[i(9„3'' + m 2 — ie)]) -1 ' 2 , i 
cause it is integrated over all functions (p. We will calculate det[i(d f ,d fl + m 2 — is)] 
later (see Sect. 11.2.2). Here we simply denote it by N _1 . So the final expression 
for ZolJ] is 



Z [J] = exp I -- J d*xd*y J(x)A F (x - y)J(y) I (11.17) 

Alternatively, the above equation can be derived using the generalized Gaussian 
integral. Note that just as integration of quadratic polynomials in an exponent is 
given by 

C d " e ~ aql+bq+C =fa^{va +C ) (1L18) 

the N-dimensional Gaussian integral in such a form is generally expressed as 

f dy 1 ---dy N e-^ T ^ +BT y +c = ^ exp (- R T A" 1 R + A (11.19) 

where y = (yi, y 2 , ■ • • , yw) and A and B are N x N and Nxl matrices. The gen- 
eralization to the path integral of the bilinear scalar field action can easily be done. 
It is to be understood that the integration can be carried out in Euclidian space 
and analytically continued back to give the real space-time result so that formal ap- 
plication of the Gaussian formula Eq. (11.19) is valid for the oscillating integrand. 
Setting 

A=i(d li r + m 2 -ie) , B = iJ, C = (11.20) 

the straightforward application of Eq. (11.19) results in 

Z [J] =N jv<p exp \-i j d 4 x ji0 {3„3" + m 2 - ie} - jM 



if M 



yJ(x)A F (x-y)J(y)\ 
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where D is a diverging constant, which can be absorbed in the normalization con- 
stant. Equation (11.21) reproduces the result Eq. (11.17). 

Normalization of the Path Integral Let us settle the question of the normalization. 
In most field theoretical questions, one is primarily interested in calculating expec- 
tation values of time-ordered operator products between the vacuum states, name- 
ly vacuum to vacuum transition rate. Consequently, the value of the normalization 
constant N is irrelevant, because we are only interested in the normalized tran- 
sition amplitude. As Z [J] is the vacuum-to-vacuum transition amplitude in the 
presence of the source J, it is sensible to normalize it to Z [J = 0] = 1. Or equiva- 
lently we can eliminate the awkward extra constant by dividing the path integral by 
Z [0], which produces exactly the same constant. This is exactly what we have done 
in defining the n-point time-ordered correlation functions in Eq. (10.183). Then 
Eq. (11.10) must be rewritten as 

/P0 exp[-t/ d A x {10 (3„3"0 + m 2 -is)cp- ]<t>}} 

° m /2>0 exp [-i/ d** {|0 (B^ + rn* -«)*}] ,11221 



-i/* 



exp -- / d 4 xd 4 yJ(x)A ¥ (x-y)J(y)\ 



We have two equivalent expressions for the generating functional. The first, in the 
form of the path integral, is convenient for defining time-ordered n-point vacuum 
correlation functions. The second does not contain the path integral, and is conve- 
nient in calculating their functional derivatives. 

11.2.2 

Calculation of det A 

As we discussed in the previous section, the actual value of det A is not important as 
long as it does not contain any dynamical variables. It can always be factored away 
by proper normalization. However, it is helpful to become familiar with the actual 
calculation of the determinant, and we do it for the case of the most familiar Klein- 
Gordon operator. Perhaps it may be of interest to mention that det A in Euclidian 
time is the partition function (see discussion in Sect. 10.7) and that it has a practical 
use in statistical mechanics: 

det A = det[i (3,, 3'' + m 2 ) d 4 (x - y)] (11.23) 

As we will show, this is a divergent integral, but it does no harm as stated above. To 
evaluate it we go to Euclidian space {x° = — ix£) and write 



S = - l -Jd A x f p(x)(d fl d" + m 2 ) f p(x) 
= -\j d 4 x E 0(x E ) (-3„3" + m 2 ) E 0(*e) 
-~2 d 4 x E d A y E (p(x 1L )A(x^y t )(p(y t ) 
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x E = (xj, x\, x\, x\) = (x\ x 2 , x\ ix°) (11.24b) 

where x E is treated as a real number. Now 

A(x E , y E ) = (-3„9" + m 2 ) E <5 4 (x E - y E ) (11.25) 

is a matrix whose row and column indices are continuous variables. The following 
reformulation is useful: 

det A = exp[lndet A] = exp ln]~[ Aj = exp{Tr[ln A]} (11.26) 

where the A's are the eigenvalues of the matrix A. The validity of the equalities can 
be most easily seen in the diagonalized form, noting that the trace is invariant if 
the matrix is diagonalized or not. The eigenvalues of the matrix A are easily seen 
in its Fourier transform 

A - H,S" + m\ d l (, c - n ) - (-3,3" + m') c f £^t'».-l".-v.) 

fe E = (fei, hi, k\, fe 4 ) = (fe 1 , fe 2 , k\ -ik°) (11.27b) 

We see A -1 is not singular, namely A has an inverse, since the eigenvalues are 
positive definite. Of course, we knew the inverse exists. It was already given by 
Eq. (11.9) as the Feynman propagator. 

Next we prove that a function of matrices f[A(x, y)] is the Fourier transform of 
the same function of the Fourier-transformed matrices, namely 

f[A(x, Y )] = J ^^ e ^~ k 'y) f[A(k, fe')] (11.28a) 

A(k,k')= I dxdye~' {kx ~ k ' y) A(x,y) (11.28b) 

Proof: For simplicity we consider in one dimension. In the above statement, the 
function is to be understood as expandable in a Taylor series: 

f(A) = c + c x A + c 2 A 2 + ■■■ c n A n +■■■ 
or f[A(x, y)} = c + Cl A(x, y) + c 2 J d Zl A(x, Zl )A(z u y) + ■•■ 

+ c n j iz t ■ ■ ■ dz n -rA(x, Zl ) A( 2l , z 2 ) • • • A(z n - lt y) + ■ ■ ■ 
(11.29) 



1.2 Real Scalar Field I 317 



For n = 2 it is easy to show that 
/ dz l A(x,z 1 )A(z 1 ,y) 

= /S^«"'-''"/f A <»W.l.') (1130, 

The nth term has n — 1 integrations, yielding n — 1 d functions, and the result is 
/ dz x ■ ■ • dz„_! A(x, z l )A{z l , z 2 ) • • • A(z n - l7 y) 

= f— — e i(fc*-fc'y) A "(fe, fe') (11.31) 

J In In n 

which proves Eq. (11.28a). If A(k, k') is diagonal, namely A(k, k') = A(k)(2n)d(k - 
k'), or, equivalently, if A(x, y) is a function of x — y, Eq. (11.28a) can be expressed 

as 

f[A(x, y)] = f — e ^-y) f[A{k)] ( n.32) 

Tr[/[A(*,y)]] = f dxdyd(x - y) f ^e^~^ f[A(k)] 

r ru < 1L33) 

Now we go back to four dimensions. Equation (11.27) tells us that 

A(fe E , fc E ) = (k\ + m 2 ) <3 4 (fc E - fe E ) (11.34) 

Then setting f(A) = In A, the determinant A of Eq. (11.27) is given as 
det A = exp [Trln A(x E , y E )] 

= exp \j d 4 x E d 4 y E <5 4 (%e - y E ) jj ^ e *<*s-w) m (feg + m 2 )[J 
= exp \J d 4 x E J |A In (fc 2 + m 2 ) J (11.35) 

Analytically continuing back to Minkowski space by setting x£ = ix°, k\ = — ik°, 
det A is given by 

det A = exp \ f d 4 x f — — ln(-fe 2 + m 2 ) (11.36) 
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As announced, this is a divergent quantity. The factor f d A x arose from the trans- 
lational symmetry. The action S stays constant for a class of the field which can 
be obtained by symmetry operations. Given a constant integrand the trace gives a 
diverging contribution, so it should be removed from the path integral. Our defini- 
tion of normalization Eq. (11.22) takes care of this. 

11.2.3 

n-Point Functions and the Feynman Propagator 

In Sect. 10.6, we derived a relation between n-point functions and generating func- 
tionals. The field theoretical version of an n-point function is 

<o|r[^ 1 )...0(*„) ] |o) = (i)" 8}{ ^ U i) {Xn \ ^ (n-37) 

which is easily obtained by taking functional derivatives of the first expression of 
the generating functional Eq. (11.22). Note, here we express the vacuum state by 
0), which is the vacuum of the free field, because we are using Z [J], the generat- 
ing functional of the free field. Looking at the second expression of Eq. (11.22), we 
see that n-point functions can be expressed in terms of Feynman propagators. 

First we show that for a quadratic action such as that of the scalar field, the one- 
point vacuum function vanishes: 

zJ(z)A v ( Xl - 

Using Eq. (11.37), this clearly gives 

<O|T[0(x)]|O) = <O|0(*)|O) = l^J| =0 
1 °J\ x i) \]=0 

Next, as the two-point correlation function is given by 

\ i [vm im Dj\ > y { j dj(x l )dj(x 1 )\ J=0 

we obtain by differentiating Eq. (11.38) 



-f*z. 



' 2 Zo[J] 



f^ziA v (xi- 



\iA F (x 1 -x 2 )+ / d 4 ziA F (xi-zi)J( Zl ) 



*m)*m ' L (11 . 41) 



xjd 4 z 2 A F (x 2 -z 2 )J(z 2 )\z ( 
After setting J = 0, we obtain 



(0|T [twite)] |0) = (I)' sj^J ^ = **(* - *) (11.42) 
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which shows that the two-point correlation function is nothing other than the Feyn- 
man propagator. This agrees with Eq. (6.124), of course, which was derived using 
the canonical quantization method. 

11.2.4 

Wick's Theorem 

We continue with n-point functions. We note that the generating functional Z [J] 
can be Taylor expanded around/: 

J dJ( Xl )\ J=0 

- + h [**-*** jm ^tLtI J(*)-J<*> + - 

n\ J dj{xi)---dj{x n )\ }=0 

^T,^fd i x 1 ---d 4 x n i n (0\T[^(x 1 )--4(x n )]\0)J(x 1 )---J(x n ) 

(11.43) 

where we have used Eq. (11.37). Another way of expressing the generating func- 
tional is 

= 1 + X]^(-^J J d 4 Xl ---d 4 x 2k A u Ai4---A2k-i,2kJij2---j2k 

(11.44a) 
A ;j = A F (*j-*j) (11.44b) 

Since, Z [J] contains only even powers of J, it is obvious that 

(O|T[0(x 1 )-..0(% 2i _ 1 )]|O) = ^ =T — -A f^ -I =0 

(11.45a) 
If n = 2fe is even, the n-point function is a sum of products of two-point functions: 

(O|r[0(x 1 )...0(x 2t )]|O)= J2 iA F(* P1 -* P 2)"-iA F (x J , 2i _ 1 -% pa ) 

(11.45b) 

where the sum is taken for all the permutations. This is one version of Wick's 
theorem. 
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Note the factor l/2 k k\ has disappeared from the expression for the 2fe-point func- 
tion. This can be seen as follows. n-Functional derivatives are completely symmet- 
ric and there are n! permutations (different orders) of n variables X\ ■ ■ ■ x n . When 
n = 2k, the coefficient of ]\ . . . J 2 k in Eq. (11.44a) Ai 2 A 34 - •• A 2 it— i,2fe has 2 k pair- 
wise symmetry because A;j = Aj; and another fe!-fold symmetry because the or- 
der of the product of fcA;/s can be arbitrary. Thatleaves (2fc)!/(2 fe fc!) = (2fc-l)H = 
(2fe — 1) • • • 3 • 1 irreducible combinations in Eq. (11.45b). As an explicit example we 
consider the case n = 4. We then should have 4!/(2 2 2!) = 3 terms for the 4-point 
function, which is given by 

(O|T[0(x 1 )0(x 2 )0(x 3 )0(x 4 )|O) = ^j E iA 'i.2 iA '3.4 

(11.46) 
= -A F (%i - x 2 )A F (x 3 - x 4 ) - A F (xi - x 3 )A F (x 2 - x 4 ) 

- A F (x x - % 4 )A F (x 2 - Xi) 

Equations (11.43) and (11.45) are important because, as we will learn lat- 
er (Sect. 11.5), the transition rate of any physical process can be expressed in 
terms of n-point vacuum correlation functions using the Lehmann-Symanzik- 
Zimmermann (LSZ) reduction formula. 

So far we have treated only the generating functionals of the free Lagrangian. 
The relation between the n-point vacuum correlation functions and the functional 
derivatives is equally valid if the Lagrangian includes interaction terms. 

11.2.5 

Generating Functional of Interacting Fields 

So far we have dealt with the action of up to quadratic terms or exactly soluble 
cases. In general, the action includes higher order polynomials or transcendental 
functions that are not analytically soluble. Consider adding a potential V(<p) to the 
free Lagrangian: 

Z = l - [9^3"0 - (m 2 - ie)0 2 ] - V(0) (11.47) 

It is not possible to solve this problem exactly even when V <JC 1. But as long as V 
is small, we can solve it perturbatively in the following way. We retain the external 
source as before and write 

S[J]=S [<p,J]- [ d*xV(<p) 

r n \ (1L48) 

So[0, J] = J d A x ^- [3„</»3'> - (m 2 - ie)0 2 ] + J(x)0(x)J 



-sfx 



I Vrpe- 
e the relation 



dJ(x~) Zo[J]= j^j f V<pe is °^n = J V<p icp(x)e> s °^ (11.50) 
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This relation is also true for any power of 0, or for a potential V(<f>) that is Taylor 
expandable: 

This relation allows us to take the V-dependent factor in Eq. (11.49) out of the path 
integral. Consequently, after exponentiation, we obtain 



= N / , D0 e - 1 /<**v(*f*)) 6 , *»'Jl 

■«»[- , /' 4 * v (T3&))]* U1 



where Z [J] is the vacuum functional without interaction. Using the expres- 
sion (11.22) for Z [J] 



x exp \- l -j d 4 xd 4 yJ(x)A r (x - y)/(y)J 



If V is small we can expand the first exponential in powers of V, and the problem 
can be treated perturbatively order by order. 



11.3 
Electromagnetic Field 

11.3.1 

Gauge Fixing and the Photon Propagator 

The electromagnetic field is gauge invariant and this produces some difficulties in 
the path integral. The action of the free electromagnetic field is given by 

S = J d 4 x\ — F /n ,F f,v = — J d 4 x{d u A„ - d v A u )(d>'A v - 3"A") 

= -- f d 4 x [d ft A v W'A v - d v A u d f 'A v - d fl A v d v A' 1 + 9„A,,9 r A"] 

(11.54) 
By partial integration 

S = -- f d 4 x (-2A p d u d f 'A p + 2A f 'd /l d v A v ) 
= jf d 4 xA u (x) [g /lv d p d p - 3„9„] A» 
= \j j^~ 4 Au(k)[-g" v k 2 + Vk v ] A„(fe) (11.55) 
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where A(fc) is the Fourier transform of A(x). The action is gauge invariant, which 
means it does not change if the electromagnetic field undergoes gauge transforma- 
tion: 

An(k) -+ A' u (k) = A„(fe) + fe„l(fe) (11.56) 

or d£ = constant for a vast class of I , which makes the path integral badly divergent: 

f VAe is ^ ~ f] fl dA'' a x constant -> oo (11.57) 

where a stands for tj, x\,, yi, z m . Namely, A /t = d /t X(x)is equivalent to A ft (x) = 
and should not be included in the path integral. The divergence problem is linked 
to another difficulty: the operator A „ „ = g M ,, 9 p d p — 9„ 9 v does not have an inverse. 
This means that the equation of the Feynman propagator 

[g, v d p d p ~ d,d v ] Df'V - y) = d;5\x - y) 

or [-g, v k 2 + KK]b;°(k) = d; 

has no solution. To see this, put D^ p (k) = Ag" p + Bk v k p , insert it in Eq. (11.58) 
and solve for A and B. The remedy we discussed in the Lorentz covariant treatment 
of the canonical quantization was to add a gauge-fixing term to the Lagrangian: 

X = £ EM + £gf = -^F /tv F' ,v - jrid/tA'') 2 (11.59) 

In particular, we saw that setting 2 = 1 in the canonical quantization produces the 
Maxwell equation, which satisfies the Lorentz condition 



<0|3 i[4 A'"|V> = (11.60b) 

where (cp\ and \ip) are physical state vectors. The same prescription works for the 
path integral. If we add the gauge-fixing term to the Lagrangian, it is no longer 
gauge invariant and there is no redundant integration which produces unwanted 
divergence. Following the same reformulating procedure as led to Eq. (11.55), we 
obtain a new action 



./,, 



-F,, F F / "' -(9,, A'') 2 

= \ f d*x A„(*) [g"*9 p 90 - (l - x) d " dV ] A »W 
= \j d*k A,(k) [-g""fe 2 + (l " X ) fe"k v ] Av(k) 
The operator 

^ / „ = g / „3 p 9"-fl-i)9 / ,9„ 
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now has an inverse and the Feynman propagator for the electromagnetic field (the 
photon) is a solution to the equation 

\g^ P d p - f 1 - x) 9 " 9 " " i£ ] D * VO{x ~ y) = 6 t' d4{x ~ y) (1L63) 

where a small negative imaginary part is added to keep the path integral well be- 
haved at large time. In momentum space it is given by 



The constant X is arbitrary. Note, setting X ->■ oo reproduces the gauge invariant 
Lagrangian, but no propagator exists in this case. Choosing X = 1 reproduces the 
familiar Feynman propagator, which we will adopt from now on. In this case 

Df, iv (* -y)= [ —e- ik ^-y)-^- = -g uv A,(x - Y )\ m 2- 
' K n J (2?r) 4 k 2 -ie S " K ri\m-o 

d p d»D ¥fl „(x-y) = g flv d*(x-y) (11.65) 

The addition of the gauge-fixing term has made the action and hence the path in- 
tegral gauge dependent, consequently the propagator is also gauge dependent, as 
is reflected by the arbitrary choice of X in Eq. (11.64). However, they all lead to the 
same physics because in the transition amplitude D Fllv appears coupled to con- 
served currents, which fulfills 3 /( j" = ->■ k ll j l '(k) = 0. As a consequence, the 
X -dependent term in the propagator always drops out and the physically observable 
transition amplitudes are gauge independent. 

11.3.2 

Generating Functional of the Electromagnetic Field 

Once we have fixed the action for the electromagnetic field, the generating func- 
tional can be constructed in a similar manner as the scalar field. We add the ex- 
ternal field to the Lagrangian and reformulate the action in terms of the external 
field and the Feynman propagator. The bilinear electromagnetic field term can be 
integrated out just like the scalar field in deriving Eq. (11.22) and again we have two 
alternative expressions for the generating functional, just as for the scalar field: 

Zo M m = NJvAex V \ijd A x \-\f^" ~ \(d fl A^) 2 + ;„A"[J 

fVAexp[ifd 4 x{\Ai i A flv A v + J fl Af}] 
/ E>Aexp[i f d 4 x {\AfA ^A*}] 

l 4 yJ u (x)D ¥/lv (x-y)r(y)\ (11.66) 



if**** 



where A uv is defined in Eq. (11.62). Note this is the generating functional of 
the free electromagnetic field. That of the interacting field will be discussed in 
Sect. 11.6. 
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The two-point vacuum expectation value can be derived from the above equation 
and again proves to be the Feynman propagator of the photon: 



(0|T[A„(*)A v (y)]|0) 



-©* 






«-'-«-/0^^ 



n.4 

Dirac Field 

The path integral deals only with classical numbers. However, we cannot write 
the Lagrangian for the anticommuting fermion fields using normal or classical 
c-numbers. We need the notion of anticommuting classical numbers, mathemati- 
cally known as Grassmann variables. 

11.4.1 

Grassmann Variables 

Polynomial We use G-variables or G-numbers to denote Grassmann variables or 
Grassmann generators. Two independent Grassmann variables r\i and rj j are anti- 
commuting, i.e. 

miJ=-1]Vi (11-68) 

In particular, 

rjj = (i not summed) (11.69) 

This means the G-numbers cannot have a length or an absolute value. Because 
of Eq. (11.68), any function of Grassmann variables including a c-number has a 
simple Taylor expansion: 

f( V ) = a+bri (11.70) 

where a and b are functions of ordinary c-numbers. For instance, an exponential 
function of G-numbers is 



Complex Conjugate A complex Grassmann number is defined by using two real 
G-numbers | and r/ 
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If we require £* £ to be real, i.e. 

(£*£)* = £*£=-££* (11-73) 

(| - ii7)(| + i 7 ) = | 2 + ^ + 2i|77 = 2if 17 = (2i|?)* 

(11.74) 
.-. (£?/)* = -£ v -+ (iv)*=Vi 

Therefore 

Si& = (& + i«7i)(£z + ivi) = (li& - I1V2) + H^iVi + flih) 

tit* = (& - »i72)(& " <9i) = && - ViVi ~ ihVi ~ »72& 

.-. (£i£ 2 )* = ££i* (11-75) 

This shows that the complex conjugate of a G-number product behaves like the 
hermitian conjugate of matrix products. 

Derivative As Grassmann variables are anticommuting, two distinguishable 
derivatives are defined. A left derivative would give 

±(vm) = (^f) 9> - Vj (f^) = S ijm - 6 ikVj (11.76) 



whereas a right derivative would give 

(dt]k\ fdVj 



Wi (vm) = Vj (jfc) - \j£) m = 6» V} - 6 ijVi (11.77) 

Thus we have to be careful as to which one is used. In our discussion, we assume 
we use the left derivatives unless otherwise noted. The left derivative can be gener- 
alized to multivariables: 



7— (Va -Vb ■■■Vc) = (-1) 1 (Va- -Vc) (H-78) 

where P is the number of permutations to move Y] j, to the left. We note that two 
r e derivatives anticommute just like G-numbers. Namely, 



9 9 
drji Br/j 


9 9 
Brjj Brji 


also mean< 
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This relation implies that there is no inverse to the derivative. This can be seen by 
multiplying the defining equation for the inverse 



9*7 V3W ^'"-^'" i"""' 

from the left by 9/9;; 

d d f d \ _1 ( 9 \ _1 9 

YnYnWl * M = \*j) *<") = ^*(9) (im) 

which is a contradiction. Thus the inverse does not exist. 

The conventional commutation relation between the derivative and the coordi- 
nate takes the form 



I i ''111 — a 7 1 ' '/ ) i — u l| i j. j..v*.i 

(9)7; J ) 9?7; J J 3?7 i J 

Integral Because there is no inverse to the derivative, the integral in the space of 
G-variables can only be defined formally, which reserves some general properties 
of ordinary integration. Since any function can be expressed as a polynomial in the 
form of Eq. (11.70), there are only two types of integral: 



/*•/ 



drjf] (11.83) 

All other integrals are zero. To derive rules for the integrals, we require that Grass- 
mann variables respect translational invariance. Let f be a constant G-number and 
change the integration variable from rj to f]' = rj — |. If translational invariance 
holds, the two integrals should give the same result: 

dtj' = dti (11.84a) 

j df)W = f drj( V - £) = f dyrj + t f drj = f dfjrj (11.84b) 

/ dr; = (11.85a) 

To evaluate f dr/?],we use the fact that a G-number commutes with a pair of other 
G-numbers: 

/ dr\r)£, = g / drjt] 

This means / drjtj is a c-number and we can define it to take a value 1. Namely 

I dt]i] = 1 (11.85b) 



1 Dime Field I 327 



The definition can be justified if we consider the fact that the integral of a total 
derivative / 3 must vanish if we ignore surface terms, and, the definite integral 
being a constant, its derivative must also vanish, namely 



/-/■ 



(11.86) 



Since the definitions of the integral Eqs. (11.85a) and (11.85b) mean that for any 
function we have 

J dm fin j) = / drjiia + bf]j) = bdij = j^ftil) (11-87) 

Namely, the differential and integral operations are identical. Hence 



/-/■ 



= (11.88) 



which confirms Eq. (11.86). 

Because the integral is the same as the differential, the Jacobian of the variable 
transformation differs from that of ordinary integration. If we redefine the vari- 
ables of the integration from t] to r]' — ar], then 



/*™-^-.^-./*7(£) 



(11.89) 



The last equality is opposite to what happens to an ordinary integral. Namely, the 
Jacobian that appears in the redefinition of the Grassmann variables is the inverse 
of what one would expect for the ordinary integral. 

Multiple Integral In multiple integrals the integration variable has to be brought 
first next to the integration measure: 

J dfj! f dt] 2 rnrj 2 = - / ir]x J dijzVzVi = - J dtjnn = -1 (11.90) 

Problem 11.1 

Consider N- variable transformation £i =J2j UijVj- Write 

j Yl^Jim,-- ,v,) = k j \\d^g(^-- ,^) (iigi) 

g(|)=/(U- 1 |) 
Using Eq. (11.85) and anticommutativity of the G-variables, prove 

jG= 3^:^) =det[t7ij] (iL92) 
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d function A delta function in the space of Grassmann variables is written as 

d( V - T}') = TJ - T}' (11.93) 

Proof: As f(rj) = a + br] 

f df](rj - r,')f(r,) = f df](r] - ri')(a + by) 

9 (11-94) 

= gT^ " *')(« + b V) = a + by]' = /(?/) 

Note, as 6{r\) = t], d(atj) = arj = {d(at])/dri}ri. Generally 

d(f(v)) = d ~J^ d W (ll- 95 ) 

which is consistent with the Jacobian defined by Eq. (11.92). 

An integral representation of the <5 -function is obtained by noting that if t] is a 
G-variable 

i J dticm-V = \f d V [l + i 7 (| - £')] = | - |' = <J(| - I') (H.%) 

Gaussian Integral Next we treat the Gaussian integral of the Grassmann variables. 
First we prove that for an integral of complex Grassmann variables 

j dr]*dr]e~' 1 *' 1 = 1 (11.97) 

The above equality can easily be proved because 

j dti*dTie-"*" = f d V *drj(l - V * V ) = -^—(\- V * V ) = 1 (11.98) 

Next we generalize the Gaussian integral to n complex variables. Denoting a Grass- 
mann vector t] J = (iji, t]2, ■ ■ ■ ,)7„),we consider an integral 



7g = / Y\ d V*^hexp[-^ At l\ 



where A is an n x n matrix and we assume A f = A, because in the path integral 
we are interested only in hermitian matrices. Then it can be diagonalized by using 
a unitary matrix U: 

D=UAU< or A=U'DU (11.100) 

where D is a diagonal matrix and is expressed as 

D ik = d ik A j (11.101) 
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Changing the variables to § = Ut] 

Ig = / fl dv*dviex V [-f< UAlfg] 

= det U< det U j f\ <*£*<*& expf-^DI] 

= n (/ ^/^j I 1 - x> **$/]) = n ^ j = det a ( n - 102 ) 

We have used the fact that the Jacobian is unity: 

J = (det U f ){det U) = det( if U) = 1 (11.103) 

We also used the fact that f d£* d£ is a c-number and can be moved freely in 
the product. Note, the integral is 2 n dimensional but the matrix is n dimensional. 
This formula is to be contrasted with the n-fold Gaussian integral over ordinary 
c-number complex variables [Eq. (10.148)]: 

I c = f dz*dz l ---dz*dz n e~ z>Az = \'' (11.104) 

J det A 

We note that the determinant of the Grassmann Gaussian integral appears in the 
numerator whereas that of the ordinary Gaussian appears in the denominator. 
The generalization to the functional integral is straightforward. Change 

7i-»"7(*) 

Y\dr,tdrii^'Dr}(x)'D V (x) (nl05) 

if At) -+ I d A xd A y r/*(x)A(x, y)ri(y) 
and we obtain 

jvrf(x)V V (x)exp\-ijd 4 xd 4 y V *(x)A(x,y)? ] (y)\ = detiA (11.106) 
Next, we evaluate an integral of the form 

h = J f\dr]tdfiifjir]f expl-^Aij] (11.107) 
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It is clear that an integral containing one or an odd number of G-variables in front 
of the exponential function vanishes. After unitary transformation the integral be- 
comes [see Eq. (11.102)] 

I f=flf itjASAUub £* u ti) I 1 - A J £f£j] ( n - 108 ) 

From this equation we see only terms of the form 

Jdifdtjijifii-Xjtfij) = jdzfd£ j z j £* (n.109) 

gives non-zero contribution. Namely, | s |* with s = t = j survive and Xj is lost. 
Therefore, the effect of inserting an extra t]kV* i n the integral I G is to make the 
change 

Xj -> U^j-Uji (11.110) 

or equivalently to multiply I G [Eq. (11.102)] by 

t^ 1 ^ : l/ji = (A- 1 ) i , (11.111) 

Thus the end result becomes 

h = j\\ d V *dv.VkV?eM-1<Ati} = det^A" 1 ^ (11.112) 

Inserting another pair t] m t]* in the integrand would yield a second factor (A~ 1 ) mn . 

Gaussian Integral of Real Grassmann Variables We can obtain similar expressions 
for real-variable Gaussian integrals with the orthogonal matrix A: 

f , , _i„t 4 „ I Vdet A n even 



The proof is more complicated than that for complex variables because the matrix 
is antisymmetric in this case. As real-variable integrals are used less than complex- 
variable integrals we leave the proof as an exercise. 



Let T] be an n-dimensional real Grassmann vector. Expand the Gaussian integral in 
a Taylor series: 

t -WM = 1~,^ + |- (VA9) 2 +-+^ (-? T A?) J +- (H.114) 
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Since the jth term contains 2] G-variables, the series does not go beyond j > 
n/2. 
Show that 

(1) For an n-dimensional integral I n 

h= j dy] x dy] 2 e'W^ = _(A 12 - A 21 ) = A 12 = VdetA (11.115a) 

I 3 = (11.115b) 

J 4 = f dri 1 ---dr j4 e-i'> TA, > 

= ~\- J2 AijA kl drj 1 ---drj 4 r] i t] j rj k rji 

,jki=i J 

= (A12A34 - A U A 2 4 + A14A23) = VdetA (11.115c) 

(2) Prove Eq. (11.113).* 



Prove that a similar equation to Eq. (11.112) holds for ordinary numbers, namely 
for an N-dimensional real vector x = (x\, ■ ■ ■ ,x N ) and real symmetric and positive 
definite D x D matrix A 



/ Y[ dx' x kl ---x kn e~ 
fP") W/2 „-i . 



(11.116) 

i = odd 



11.4.2 

Dirac Propagator 

We are now ready to express the anticommuting Dirac field in terms of Grassmann 
variables. We may define a Grassmann field xp(x) in terms of any set of orthonor- 
mal basis functions: 

tf-(*) = X>j0j(*) (11-117) 

The basis function cpj (x) are ordinary c-number functions, but the coefficients ipj 
are Grassmann numbers. Then the Dirac field can be expressed by taking <p(x) 
to be the four-component Dirac spinors u r (p)e~ l P' x and v r (p)e _lp '*. Using the 
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Grassmann-Dirac field, the 2-point correlation function for the Dirac field is 



fVyVxj) ex V [i f d 4 x V(i#- m)v] 

(11.118) 

It is also to be understood that the integral can be analytically continued in the time 
variable complex plane to ensure the good behavior of the integral or equivalently 
a small imaginary part is is implicit to produce the right propagator. Comparing 
Eq. (11.118) with Eq. (11.112), we obtain 

(0\T$(x)t(y)]\0) = iS r (x - y) = [-i(ifl- m)]"M 4 (x - y) 



"/ 



P»)V- 


m+ie 


d4 P ,-i„ 


-,,-vl • +« 


(2^) 4 


^_ m 2 + i£ 



11.4.3 

Generating Functional of the Dirac Field 

Using the Feynman propagator, the generating functional of the Dirac field is writ- 
_ JVljJVip exp [i f d*x {yT(iy- m)xp + rjrp + ^17}] 
= exp\-i f d 4 xd 4 Y tf(x)S,(x-Y)7 1 ( Y )\ (11.120) 



The derivation of the second line from the first is completely in parallel with that 
of the scalar field [see Eq. (11.22)]. Then the n-point correlation function can be 
derived by successive differentiation of the generating functional. We have to be 
careful only about the sign change depending on the position of the Grassmann 
variables. The operation —id/dlj pulls down ip but to pull down xp, id/dt] is nec- 
essary because it has to be anticommuted through %p before it can differentiate ?;. 
If we apply two derivatives to Z to pull down xp(x)ip(y): 

H4io)G^)/^^ exp [ i / d4z{w " m) ^ +fv ' + ^ } ] 

= (-- i -—-)jvJV^^(x)exp\iJd 4 z {^(i?-m)V + 7V+^}| 

= -jvlpVip ip(x)lp(Y) exp U J d 4 z {lp(i$- m)xp + Jjxp + ljJrj}\ (11.121) 
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Note the minus sign was added because d/dt] has to go through xp, which was 
pulled down by the first derivative. If the ordering of the operators and of the deriva- 
tives is the same, no extra sign appears, namely 



(1)" 



= <o|r$(x)v(y)]|o> 



8r,(y)6rj(x)\_ 
This formula can be extended to a general 2n-point function: 



W 6ri(yT) — dri(y n )dri(x 1 )---dri(x n )\_ =ri=Q (11.123) 

= {0\T[{l'(x 1 )---{p(x n )t(Yi)---t(Yn)]\0) 

To prove that we have the right phase in the above equation, we first show that if 
the derivatives are ordered in exactly the same way as the operator products, the 
relation between the two is expressed as 



H)"G) 



6rj(y n )---dri(y 1 )dr}(x n )---dTi( Xl )\ lf=tl=0 (11.124) 

= (0\T[t(Yn)---t(Yi)<P(Xn)---y(xi)}\0) 

The proof goes as follows. The first derivative d/dlf(x 1 ) pulls xp(xi) down from 
Z and produces no sign change. The second derivative has to be anticommuted 
with xp(x{) before pulling down xp(x 2 ) from Z , producing one extra — 1. But it is 
canceled by moving the pulled-down xp(x 2 ) to the left of xp(x 1 ). Similarly, the ith 
derivative has to be anticommuted i times before it pulls xp(Xi) down from Z , but 
the generated sign is canceled exactly by moving the pulled down xp back to the ith 
position. The argument is similar for 8/dr] except for an extra —1 to go to the right 
of xp in the exponent before the derivative is applied to ;; . 

Rearranging the ordering of the operators to move all xp (y ;) to the right of xp (xj ) 
produces the minus sign of (— 1)" . Hence the total number of — l's becomes n 2 + n 
and (—1)" + " = 1. Renaming X\ and y; in ascending order recovers Eq. (11.123). 



11.5 

Reduction Formula 

11.5.1 
Scalar Fields 

Here we introduce the LSZ (Lehmann-Symanzik-Zimmermann) reduction for- 
mula [260], which relates the scattering matrix to n-point vacuum correlation func- 
tions. We first consider the case of scalar particles. It is straightforward to gen- 
eralize to other types of particles. We assume the weak asymptotic condition of 
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Eq. (6.26) with Z set to be 1, namely 

( JmJ/|0(%)|i> = </|0in(x)|i) (11.125) 

The "in" and "out" states are free states, i.e. on the mass shell, and can be described 
by the free field. For scalar fields, the free-field operator can be expressed as 

<N*) = f n i ^ \ a ^l) e ~ iq " + flt in(g) e H (H-126) 

out J (2jiy^/2co q I out out J 

The scattering matrix is given as 

S fi = (out : f\i : in) = (in : f\S\i : in) = (out :f\S\i: out) (11.127) 

Then we can define the in (out) state vector of definite momentum as 

\q z ) = y/toa1(q)\Q) = -iy/lH j d'xe^X >nJx)\Q) 

T r , «+. 1 ( n - 128 ) 

= t J? 00 [- i i / d 3 */,(*)3.0(*)|Q)J 

where 

e g = ^=-, /, = e~ iq ' x , aXb = Ad t B-(d t A)B (11.129) 

Here, although not explicitly written, we assume that the operators appear sand- 
wiched between the free states, as they should be consistent with the condition 
Eq. (11.125). Next, we wish to convert the three-dimensional integral into a Lorentz- 
invariant four-dimensional integral. To do that we use the identity 

( lim - Jim ) / d i xA(x) = / dtd t / d l xA(x) (11.130) 

Consider, for example, an S-matrix element for the scattering of m particles with 
momenta (q\, q 2 ,- ■■ ,q m ) to n particles of momenta (pi, ■■ ■ , p n ). We want to ex- 
tract the free field cp - m (x) from it. First note 

(out : 0\a : in) = (out : 0\Jto at(q)\a - q : in) (11.131) 



where we have used (/3|, \a) and \a — q) to denote states cc 

particles, respectively. In | a — q) a particle having momentum q is taken out. Then 

using Eqs. (11.128) and (11.130), Eq. (11.131) becomes 

(out : /3|a : in) = Jim^ \-i f d 3 x f q (x)X '{out : /9|0(x)|a - q : in)] 

= i J d 4 xd t [f q (x)X(out : 0\4>(x)\a - q : in)] (11.132) 

- ( lim [if d'xf^X {out: p\$(x)\a-q:m}^ 
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The second term is just a creation operator of the "out" state, or an annihilation 
operator if it acts on the left, therefore 

(out : P\al ut (q)j2^ = (out : fi - Pj , Pj \al ut (q)^T q 

= (out : - pjla^pjjaUq)^-^!^ 

= (72^2^) (out :P- PJ \ {[a mt ( Pj ), <,(<?)] + aUq)a out ( Pj )\ 

= 2coj (2ji) 3 d 3 (pj - q)(out :P-q\ 

+ ^7J^r q (out : fi - Pj\al t (q)a out ( P] ) (11.133) 

The process can be repeated until al ut (q) goes to the left end and destroys the 
vacuum. The term vanishes if (j3 | contains no particles with momentum q. This 
term is called a "disconnected diagram", because one particle emerges unaffected 
by the scattering process and is hence disconnected from the rest of the particles. 
The first term on the rhs of Eq. (11.132) gives 

= i J d 4 x 3, [f q (x)X(out : fi\4>(x)\a - q : in)] 

= -if d 4 x {d]f q {x) - /,(*$} (out : P\fc)\a -q:in) 

= ijd 4 x {(-V 2 + m 2 )f q (x) + f q (x)d]) (out : /9|0(x)|a - q : in) 

= i j d*x {f q (x)(8 2 t - V 2 + m 2 )} (out : /9|0(x)|a -q:in) (11.134) 

where in deriving the third line we have used the fact that the plane wave function 
f q (x) satisfies the Klein-Gordon equation and the last line was obtained by partial 
integration. We now have a Lorentz-invariant expression. Collecting terms, we find 

(out : fi | a : in) = ( disconnected diagram ) 

r a , (ii.i35) 

+ ij d 4 xf q (x)(3^ + m 2 )(out : /9|0(x)|a - q : in) 

We have extracted one particle from the abstract S-matrix and converted it into a 
matrix element of the field operator <p . Next, we extract the second particle with 
n p from the "out" state. Disregarding the disconnected diagram, we get 

:in) 



(out : P\<p(x)\a- q:in} = (out : - p\a ml (p)^T v <p(x)\, 
= (out : -p\0(x)j2c^a m (p)\a- q : in) 

+ (out : p -p\{a out (p)j2^0(x) - 0(x)J2c^a m (p)}\a - q : in) 
= (out : -p\<p(x)\a - q - p : in) 

f;(x')V(out : p -p\4>(x')4>(x)\a- q : in) j] 
f;(x')V(out : - p\$[x)4>(x')\a - q:m}\ 
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Disregarding again the disconnected diagram (the first term), the second and the 
third terms can be combined to give 

= i j d 4 x'd/ \f?(x')V (out :fi-p | T[j>(x')j>(x)]\a - q : in)] 

= ijd A x'f;(x')(d;d^ + m 2 )(out : - p\ T[0(x'W(x)]\a - q : in) 

(11.136b) 

where T[- • • ] is a time-ordered product. The process can be repeated on both sides 
until we extract all the particles. We can always choose the "in" and "out" momen- 
ta so that there are no disconnected diagrams. Then the final expression for the 
g matrix for m incoming particles and n outgoing particles is given by 



(0\S\a) = i m+n j fl d4x 'j fl d 4 x k f; j (x' J )(d/d?' + m 2 )j 

x (Q\T[4>(xi)--4(x' n )4>(x i )--4(x m )\\Q)(t^ + m 2 ) k f qk (x k ) 

(11.137) 

where we have arranged the incoming part of the operation to the right of the 
correlation function, although how it is ordered is irrelevant for the case of boson 
fields. This is called the LSZ reduction formula to convert a S-matrix to a vacuum 
expectation value of an m + n-point correlation function. Note, we have derived the 
above formula in the Heisenberg representation. The field rp is that of interacting 
particles, which contains all the information on the interaction and the 
state is also that of the full, interacting theory. 

As the n-point time-ordered correlation function, in turn, can be expressed a 
functional derivative [see Eq. (11.37)], the S-matrix expressed in terms of ge 
functionals is given by 



"/ fl d ' x j fl ^/* (Xj'MV^ + m 2 ), 



" d m+n Z [J] 



(d fl d fl + m 2 ) k f n (x k ) 

(11.138) 
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11.5.2 
Electromagnetic Field 

Since the free electromagnetic field (in the Lorentz gauge) obeys the Klein-Gordon 
equation with mass = and is expressed as 

« J W^r [ ° J (11 .139) 

+ e "* (fe,1,fl k (fe ' A)e,i:x ] 

the reduction formula of the electromagnetic field is almost the same as that of the 
scalar field, except for the addition of the polarization vector. Namely, 

x (Q\t[A^(x[)--- Af«(x' n )A^(x 1 )--- A^(x m )]\Q) 
x(o^¥) k e 0k (q k ,A k )f qk (x k ) (11.140) 



We can express the vacuum correlation function in terms of functional derivatives. 
Referring to Eq. (11.37), 

{0\S\a) = J fl d 4 x'j f[ d 4 x k /*.(*>* tpj.X })<&&')) 

x ^ m+ "Z[J] 

dJPi(x[)---dJP»(x' n )dJ°i(x 1 )---dJ^(x m ) 

x(t^) k s ak (q k ,h)U(xk) (11.141) 

11.5.3 
Dirac Field 

It is straightforward to apply the reduction formula to the Dirac field. The differ- 
ence between the scalar field is that we have to be careful about the ordering of the 
operators since the Dirac field is a four-component spinor and anticommuting. As 
before, we first postulate the asymptotic condition as 

t j\mjf\{p(x)\i) = (/IV-MtoJi) (11.142) 

where the free Dirac field can be expressed as 



/ al\mr [W afa fr' r ) + W&(p- r )] < 1L143a ) 

U' p (x) = u r ( P )e~^ , VI (x) = Mp)^'* (11.143b) 



Vin(*) , 

' (2;r)3 ^ 
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(iff-m)$ in (x)=0, Vfa(*)H?-m)=0 (11.143c) 



a^,r ) = fd 



-f i3 ***<* 






(11.143d) 

(11.143e) 

The reduction formula is obtained following the same logical flow as the scalar 
field: 

(out : p\q: in) = (out : p \ j2E~ q a'Jq, r)\Q : in) 

= -(out : p\[aUq,r)^2E q - ^2E q al(q,r)\\Q : in) 

+ (out: p\j2E,al at (q,r)\Q: in) 

= -(lim -Jim ) f d'x (out: p\t(x)\Q :m)y°U r (x) 

t^oo t-i—oo J (11.144) 

+ disconnected part 
= i J d 4 x (out : p\id {t(x)y°U r q (x)}\£2 : in) + (•••) 
= ij d 4 x{out:p\{t(x)iT y U r q (x) + t(x)it y UZ(x)} 

x\Q:in) + (•••) 

We convert the time derivative of U q (x) to a space derivative using the equation of 
motion 

f d 4 xt(x)ity°U r q (x) = f d*xt(x){-iy^ + m}U r q (x) 
J J (11.145) 

= J d 4 xt(x){iy^+m}U r q (x) 

where the last equation is obtained by partial integration. Then the first term in the 
last equality of Eq. (11.144) is combined to become 

(out:p\q:m) = iJd 4 x(p\t(x)\G : in) (if + m)U r q (x) (11.146) 
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Similar calculations can be carried out and give 



V^(c 


«ut:/| 6 


&(*)l* 


:in> = 


^(01 


it: /la. 


ut(p)|i 


:in> = 


V2a/(o 


ut:/|i: 


L(«')i» 


:in> = 


^(ou 


t:/|h« 


*(/)!> 


:in> = 


For the one-particle transition q 


^p,w 


(out: p\q 


:in) 






= H) : 


7- 


d 4 y U f 


.(y)(*? 



-i)J d\(f\t(x)\i)(-if-m) x U^x) 
-i)j d*yU s p (y)(if-m) Y (f\y(y)\i) 
i f d 4 x'V;,(x')(if -m) x ,(f\y,(x')\i) 

! <jd 4 y'(f\t(y')\i)(-iT-m) r v;,(y') 



m) y (Q\T[ip(y)ip(x)]Q) 
x(-i}-m) x U q (x) (1L148) 

= (-i) 2 jd 4 xd*yU p (y)(if-m) Y 

We now have to extend the reduction formula to many-particle systems. As the 
ordering of the operators is important, we begin with the transition amplitude. In 
the transition amplitude, particles are arranged in such a way that the subscripts in- 
crease from left to right and antiparticles are placed to the right of particles. Namely, 
denoting the momenta by 

fli H \- q m + q 1 -\ h q m , ->- pi -\ \- p„+J 1 -\ \-J„, 

the transition amplitude can be written as 

(out: fi\a : in) = (out: p lr - ■ , p„\ p v •• • , p n ,\qi,-- ■ ,q m ) 
lv— ,q m > :in> 



= (i)»+»(_i)» 



•/nn**^nn« 



xT7,(yj)(i?-m) w V^(4)(i/-m)^ 
x (Q | T[v(yi) • • • V(y» )?(yi) • • • ^(y^(*i) • • • y(x m ) 
n<)"-n<')]\0)(-if-m) M U i (xi)(-it-m) ytv V p ,(y , if ) 

(11.149) 
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Here, the ordering of the operators matches that in (out : /J | a : in) exactly. Rear- 
rangement of the operators produces additional sign changes, of course. To express 
the above formula in terms of functional derivatives, the vacuum expectation value 
of the operator T-product can be replaced by the functional derivatives 

^"\m;v *^-i^ki ,n - 150) 

provided we keep the ordering of the functional derivatives exactly as that of the 
operators in the T-product [see Eq. (11.124)]. In the following, we only give the case 
for n = m, ri = m' = 0: 

(out: Pl --p n \qi--qn-m)= J fld'x^YiU^y.Hif-m)^ 

a 2 "Zo[f,>7] 



where the phases are exactly canceled first by that of the functional derivatives and 
second by moving d/Tj(yj)'s to the right of d/dr/(x)'s. 



11.6 
QED 

We are now ready to apply the method of the path integral to actual calculation 
of QED processes. We shall derive the scattering amplitude for reactions such as 
the Compton and e-/i scatterings and show that they produce the same results as 
calculated using the canonical quantization method. 

11.6.1 

We now want to write down the scattering matrix describing the interactions of 
electrons and photons. In the previous section, we learned to express the scattering 
matrix in the form of the LSZ reduction formula. As it is expressed in terms of 
the functional derivatives of the generating functional of the interacting fields, the 
remaining task is to derive the generating functional of interacting fields involving 
the Dirac and electromagnetic fields. 

In QED, which describes phenomena involving the electron (a Dirac field) inter- 
acting with the electromagnetic field, the whole Lagrangian £qed can be obtained 
from the free Lagrangian of the electron and the electromagnetic field by replacing 
the derivative d fl by the covariant derivative D fl = 9 /( + iqA fl , a procedure called 
"gauging". 



First we write the gauge-invariant full Lagrangian for the electron interacting 
with the electromagnetic field (photon). 

£free J! ~^ ^QED = —jV"' +W9~ m W (11.152a) 

9=y"(9„ + ;qA„), (q = -e) 

£ Q ED = £o + JEint (11.152b) 

£o = -^F flv F^ + lp(i?-m)ip 
^int = -qxpy ,l xpA fl 

The action of the generating functional of the interacting electrons and photons 
should include, in addition to X of Eq. (11.152b), a gauge-fixing term 

-i^A") 2 (11.153) 

and source terms, and hence is not gauge invariant. We adopt the Feynman gauge 
and set X = 1. Then we can write down the generating functional: 



Z[rj, rj,J]=N VrpVipVAexp i / 



t --(3 / ,A") 2 + /„A"+W + V»7j| 

where N is a normalization constant to make Z[0, 0, 0] = 1. After the standard 
manipulation to integrate the bilinear terms [refer to Eqs. (11.66) and (11.120)], the 
generating functional Z [Tj, t], J 1 '] for the free fields, namely those with d£ int = 0, 
can be expressed as 

ZolV^J] = exp\-i J d A xd*y 

x Uj*(*)JV(* - Y)J V (Y) + V(x)Sv(x - Y)v(y)\] 

(11.155) 

We learned in Sect. 11.2.5, that the generating functional of the interacting fields 
can be obtained by replacing the field in the interaction Lagrangian by its func- 
tional derivatives [see Eq. (11.53)]. In QED, we replace the fields in the interaction 
Lagrangian by 

_ 1 6 16 16 

*^"6j' *^JW A ^~iJp' (1L156) 
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Therefore, the generating functional for QED is given by 

xZoW.V.J] 

-"-[/*"H£)h.*G£)({£)] 

xZ [Jj,7 h J] (11.157) 

We retain the minus sign in front of the coupling constant because the action is 
iS = i f d 4 x£ int = / d 4 x{— i V(x)} and the perturbation expansion is carried out 
in powers of minus the coupling constant, i.e. in (— iq) n . 

11.6.2 

Perturbative Expansion 



The exponential of the 
the coupling strength | q 



part in Eq. (11.157) can be Taylor-expanded ii 



Z[TJ, V ,J]=NJ2^ 



Note, t] and t] are actually four-component spini 
abbreviation of 



(11.158) 
ind the above expression is an 



El 






In the following we calculate the first- and second-order terms explicitly and ex- 
press these results graphically. The Feynman rules that establish the connection 
between the algebraic and graphical representations are very useful in understand- 
ing the so far very formal treatment of the functional derivatives and are of great 
help in performing actual calculations. 



11.6.3 

First-Order Interaction 



The first-order interaction term is given by 



.-'{M-^h^g * 



(11.160a) 

-!/-(-t^)<-«)(^)(t^)! 

xexp -i d 4 z 2 d 4 zA-J p (z 2 )D Vpa (z 2 -z i )J"(z i ) 

+ rf l (z 2 ){S F (z 2 - z 3 )h m Tj m (z 3 ) jl (11.160b) 

= "i/^(-T^w)H^.)! 

x U d 4 Zl d 4 z,D^ a (x - Zl )J a ( Zl ){S r (x - Z 3 )} im rj m (z 3 )\ 
xexp -i / d 4 iv 2 d 4 w 3 -J"(u;2)D Fa/ 5(u;2-W3)J /:; (iV3) 
+ ^ s (w 2 ){S F (w2-w 3 )} st j; t (W3)j 
= I" + iN j d 4 xd 4 z x D Vfia (x - Zl )J"( Zl ) {( - iqy^){St(x ~ *)M 
+ N /" d 4 xd 4 z 1 d 4 z 2 d 4 z 3 D 1 , li0 (x-z 1 )J a (z l ) 
X ( " »«??»){&(* " Z 3 )}fc m )? m (Z3))?,(Z2){S F (Z 2 - *)},„] Z [f, 7 , J] 

= [iNJ d 4 ^d 4 Zl Tr[(-iqy")SF(% - *)] D fflv (x - Zl )J"( Zl ) 
+ (-l)N j d 4 xd 4 z 1 d 4 z 2 d 4 z 3 D Fuv (x-z 1 ) J" ( Zl ) 
xJJ(z 2 )S F (z 2 - x)(-iqyt')S F (x - z^riiziiizolJf,?], J] (11.160c) 

The first term vanishes because 

Tr[y^ ( o)] = I 1*^7 m = f*PW = 

L/ FUI ./ (2ji)* pi -m> -is y (27r) 4 p2_ m 2_ i£ 

(11.161) 
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Note the extra (—1) phase in the second term of the last line, which was generated 
by moving If to the left of t]. This does not happen for boson fields. The final 
expression for the first-order generating functional is 

ZiW- 1 J] = ~N j d 4 xd 4 Zl d 4 z 2 d 4 z 3 D Fuv (x - Zl )J v ( Zl ) 

x?f(z 2 )S ¥ (z 2 - x)(-iqy")S ¥ (x - Z } ) V ( Zi )Z [v,V, J] 

(11.162) 

The expression Zi/Z is pictorially illustrated in Figure 11.1. 

It represents several different processes. The reduction formula decides which 
one to pick up. In Figure 11.1a, an electron is created by a source denoted by a 
cross at z 3 , goes upward in time, interacts at x with the electromagnetic field and is 
destroyed by a source (or sink) at z 2 . Here, a photon is created at z 1 and is absorbed 
by the electron. But in (b), the photon is emitted by the electron at x and destroyed 
by the source (sink). The electromagnetic field is real and hence the photon is its 
own antiparticle. This is why the same source can act both as a source to create a 
photon and as a sink (negative source) to destroy it. In Figure 11.1c, the sense of 
time direction is reversed to denote a positron (electron's antiparticle) created at z 3 
and destroyed at z 2 ; otherwise it is the same as (a). 

The pictorial description of the expression is as follows: 

1. Crosses represent sources or sinks (r/, If, ] v ). 

2. Dots represent interactions. "— iqy'"' denotes an interaction in which a Dirac 
spinor comes in and goes out, at the same time emitting or absorbing a photon. 

3. Lines with an arrow represent the electron propagator, Sf(x — z 3 ) going from 
z 3 to x and S?(z 2 — x) from x to z 2 . The arrow is directed downward if the 
line represents an antiparticle (positron). The spin state of the electron was 
suppressed but can be attached if necessary. 

4. Wavy lines represent the photon propagator D? uv (x — Zi). 



er marked by a cross (zi), interacts at the 
vertex (x) and is destroyed at the upper ct 
(Z2). A photon is created by a source at z 



(d) 



(e) 



and absorbed by the electron at the vertex (a) 
or emitted by the electron and destroyed by 
the source (b). (c) The same as (a) except a 
positron is created atZ3 and destroyed at Z2. 
(d) Electron positron pair creation by a pho- 
ton, (e) Pair annihilation to a photon. 



11.6.4 

Mott Scattering 

As a preparatory step to quantum field theory of the scattering matrix, we first de- 
scribe a semiclassical treatment of electron scattered by a fixed Coulomb potential. 
This applies to the scattering of electrons by heavy nuclei. The nucleus is very heavy 
and its recoil can be neglected in low-energy scattering. Then the electric field cre- 
ated by the nucleus can be treated classically and only the electrons are treated 
using the quantized field. The electromagnetic field is not replaced by a functional 
derivative but is a classical field; it is represented by a Coulomb potential 

A,, = (0,0,0,0), </> = — ^L- (11.163) 

where the electric charge of the nucleus is taken to be Zq' and that of the electron 
q = — e. The generating functional becomes 



'/***■ 



' '\idlf(x)J 

z 3 rf(z 2 )S f (z 2 -z 3 )Ti(z i ) 



A fl (x)\ 



= N [-if d 4 xTr[(-iqy")S ¥ (x - x)]A fl (x) 
+ f d 4 xd 4 z 2 d 4 Zi Tj(z 2 )S ¥ (z 2 - x)(-iqy")St(x - z 3 )?7(z3)A„(x)l 

l 4 W i J](W 2 )Sf:(W 2 — Wj)T}(Wj) (11.164b) 



■i I d 4 w 2 d 4 w 



The first term does not contribute. The reduction formula Eq. (11.148) for an elec- 
tron of momentum p coming in and one of p' going out becomes 

S fi = (out: p'\p :in) 

= (— i) 2 / d 4 w 2 d 4 w i U p '(w 2 )(i$ — m) wi 

= / d 4 xd 4 w 2 d 4 w i U p >(w 2 )(itH — m) W2 
x S f (w 2 - x)(-iqy*)St(x - w 3 )(-if-m) W} U p ( Wi )A fl (x) 
From the properties of the Feynman propagators and the Dirac wave functions 
(if-m) W2 S v (w 2 -x) = d 4 (w 2 -x) 
S F (x - Wi)(-if - m) Wi = d 4 (x - u> 3 ) (11.166) 

U p (x) = u r {p)t~^\ U p ,(x) = u s (pyr'-* 
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the scattering amplitude becomes 

S fi = f d A xe-^-^ x u s {p')(-iqy")u,-( P )A pl (x) 

= -2nid(E' - E)f ' dW»->'>-» (^;) « s (?V°« f (?l 

q 2 = (p-p') 2 = 4|p| 2 sin 2 



where a = e 2 /4ji = 1/137 is the fine structure constant and 6 is the si 

angle. Equation (11.167) has reproduced the Mott scattering cross section formula 

Eq. (6.136). 

11.6.5 

Second-Order Interaction 

Referring to Eq. (11.162), the second-order interaction is given by 
xZ \f},rj,J] 

= f !/ ^ {-i-m) <-^ p > {i-m) {ijm)} 

x - / d 4 xd 4 z 1 d 4 z 2 d 4 z i Ti(z2)S F (z2-x)(-iqy f ') 
x S r (x - 23)77(23) D ¥flv (x - z 1 )J v (z 1 )Z \rj, r,, J]~\ 

- f |/ ^ H«4)) H ^> (la4)) (i^))| 

x \jd*xd 4 z 1 d 4 z 2 d 4 z i Clri, V J}Z lTj, V ,J]~\ (11.168) 

where we have separated the part of the fields already pulled out of Z [Tj, r/, J] 
by the first interaction derivatives and abbreviated it as C\rj, r\, J]. Then the effect 
of the second interaction derivatives on Z is to pull an electron or photon line 
with external source out of Z and connect it to the second vertex at y. Namely, 
it adds an incoming or outgoing particle to the vertex at y. On the other hand if 
the derivatives act on C[fj, rj, J], they connect either an electron line or photon line 
from the vertex x to the vertex y. Namely it draws an internal line between the two 
vertices. Depending on which of the terms (C\rj, rj, J] or Z [tf, r], ]]) are acted on 
by the three functional derivatives, there are six distinct processes, which will be 
explained one by one. 



Disconnected Diagrams When all three derivatives act on Z [Tj, r\, J], they pull 
down another trio of external lines and there are no internal lines to connect x 
and y. The contribution is 

Zn = - - / d 4 xd 4 z 1 d 4 z 2 d 4 z i rj(z 2 )S F (z 2 - x)(-iqy' 1 ) 

xS v (x-z 3 )r,(z 3 )D t/iV (x-z 1 )J v (z 1 )\ 

x - / d 4 yd 4 w 1 d 4 W2d 4 w 3 rJ(w 2 )S f (w2- y)(-iqy p ) 

x S ¥ (y - w i )7i(w i )D Vpa (y - w l )J°{w 1 ) \z [lf,t],J} 

= i(|) 2 Zo (11,69) 

where Z\ is defined by Eq. (11.162). This is two separate processes generated by 
two independent "trios of external sources", each one identical to the first-order 
interaction, and is depicted in Fig. 11.2. 



(a) 



(b) 



Figure 11.2 Among the second-order processes, these discon- 
nected diagrams describe two independent processes, each one 
identical to the first-order interaction in Fig. 11.1. 



This is the so-called disconnected diagram and describes two independent pro- 
cesses, each of which has already appeared in the lower order perturbation. We 
can eliminate the disconnected diagrams by using W\rj, t], J] = — iln Z[fj, t], J] 
instead of Z\fj, t], J], which will be explained later in Sect. 11.6.7. 

Four External Electron Lines This is the case when both 8/dlf and 8/dt] act on 
Z and pull down two electron lines while 6/6 J acts on C and connects the two 
vertices by a photon line. This contributes to electron-electron or Bhabha (e~ -e+) 



a-V/wwW y xyjwwJy 



Figure 11.3 Four-point functions: Diagram 
Bhabha (e~-e+) scattering. 



contribute to (a) electron-e 
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scattering (see Fig. 11.3): 

222 = t 1/ ^ Ha4)) <- i ^ p ) {ism) {i*m)\ 

x r - J d 4 xd 4 z l d 4 z 2 d 4 z i {l](z 2 )S 7 (z 2 - x)(-iqy u ) 

x S F (x - Z3 ) ?/ ( Z 3)D F/ , F (x - Zl )J"( Zl )}Z [77, r,, J]\ 

i f 

= -N / d xd yd z 2 d z^d wid W} 

x {f(z 2 )S F ( Z2 - x)(-iqy")S F (x - z 3 Mz 3 )D Fw (* - y) 
x )7(u; 2 )S F (u; 2 - y)Hqy")S F (y - w 3 ) V (w 3 )}Z (11.170) 

The S-matrix of the process will be discussed later. 

Compton-Type Process Here dZ /8j and dZ /dtj pull down one more photon 
line and an electron line while the remaining dC/drj connects the two vertices by 
an electron line. This gives the first term in the next equation and can represent 
Compton scattering, pair annihilation into two gammas or pair creation by two 
photons. Another one where the role of r\ and rj is exchanged gives the second 
term: 

-f!/^(-^).--')(}^)(^)! 

x r - J d 4 xd 4 z l d 4 z 2 d 4 z i rj(z 2 )S?(z 2 - x)(-iqy><) 
x S F (x - z 3 )r,(z 3 )D V/iV (x - z 1 )J v (z 1 )Z [if, r,, J]~\ 
= -N / d A xd 4 yd 4 z 1 d 4 z i d 4 Wid 4 w i {j a (wi)D I ; ap (w 1 — y) 

xD V/iV (x - Zl )/"( Zl )} j»7( Z 3)S F ( Z 3 - x)(-iqy") 

x S F (x - y)(-iqyP)S ¥ {y - w 3 )rj(w 3 ) 
+ tf(w 3 )S ¥ (w 3 - y) 

x (-iqy")Sv(y - x)(-iqy")S F (x - Z3 )?7( Z 3)[zo (11.171) 

The second term is identical to the first and compensates the factor two. This is 
seen if we exchange the variables x -o- y , z 3 -o- w 3 , Z\ -o- W\ and /u ■<-> p. Note, the 
diagrams (a-d) in Fig. 11.4 are described using the notation of the first term, but 
every diagram is included in both terms as every variable is integrated and can be 
renamed. The S-matrix of the process will be treated later. 

Vacuum Energy When all three derivatives act on C\rJ, r], J], they connect all the 
external lines with each other and no external lines remain (Fig. 11.5a). This is a 
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Figure 11.4 Four-point functi 
Compton scattering, (c) pair annihi 
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vacuum-to-vacuum transition. The expression has the form 

Z 24 = l -N J d 4 xd 4 Y rr[(-iqyP)S F (Y - x)(-iqy") 

xS f (x-y)D ¥p/i (x-y)]Z [1J, V ,J] 

It is absorbed by the normalization constant N to give Z 2 [0, 0, 0] = 



Photon Self-Energy When only the photon derivative acts on Z , and the others 
on C, the two electron lines already pulled down into C are connected to each oth- 
er to make a loop. This is called a photon self-energy diagram (Fig. 11.5b). The 
expression becomes 



— j d*xd*yd*zd* W J»( W )D Vop ( W - y) 

k Tr [(-iqy»)S v (y - x)(-iqy")Sv(x - y)] 
<D V/JV (x-z)J v (z)Z 



i to the photon propagator. Note the extra minus sign for the 
loop. Whenever there are fermion loops, an extra sign (—1) is added, which comes 
from reordering the fermion fields. This appears in the higher order correction in 
the vacuum expectation value of the two-point time-ordered correlation function. 
The whole propagator consists of functional derivative of Z and Z 2 s and it is given 




Figure 11.5 (a) Vacuum-to-vacuum transi 
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by 



<W„(*»My)]|fl> = (jj ^, W(3r(y) 2 ^\^ 



Vi; «J/"(*)dj"(y) K ' » .*-*-;-« 

= iD F/(F (% - y) + J d 4 u;d 4 zD F „ p (% - w) 

x Tr [(-iqyP)S F (w - z)(-iqy °)S F (z - w)] D ¥ov (z - y) (11.174) 

The factor 1/2 was canceled by taking second derivatives of the quadratic J a J v 
term. Using the momentum representation of the propagators Eqs. (11.65) and 
(11.119) and integrating over w and z, the second term becomes 



f d*k ,. t , x _ y) - gflp f d<p 

J (2jzY k 2 +i£j (2tT> 4 



""L v "«' V-m + i £ v '" '/-Jf-m+iejk' + ie 

Then the momentum representation of the photon propagator including the sec- 
ond-order correction is expressed as 

iD F , lv (k) = ~ tg/ ': + ~' gw (-inP°) ~ lga : 

' v ' fe 2 + ie fe 2 + ie^ ' k 2 + is 

- j/7P =-/(4 (1L176) 

^[H^yri^ ( - igy V^-m + ie ] 

Thus we have reproduced the radiative correction to the photon propagator 
Eq. (8.49b). 

Electron Self-Energy When only one of the electron derivatives acts on Z and the 
others on C, this pulls down an electron line from the source and connects it to the 
second vertex through both a photon and another electron line, i.e. this is the self- 
energy correction to the electron vertex (See Fig. 11.5c). 

Problem 11.4 

Derive the electron self-energy corresponding to Fig. 11.5c and show that its vacu- 
um expectation value becomes a higher order correction term to i S F (x — y) 



iSp(X-y) 2nd order 

= J d 4 wd 4 z S ¥ (x - w)(-iqY")Sv(w - z)(-iqy v )S„(z - y) 

xD ¥flv (w-z) 

= [ iX -ir(*-y) ! iSt{p) \ 

J (2jz)* f-m + is Wtf-m + ie 

^> = / (0 = ^H^) y _ JK _ i w+ . e H^ y ) (H-177) 

which reproduces Eq. (8.66). 

11.6.6 
Scattering Matrix 

Electron-Muon Scattering To treat the interaction of the muon and the electron, 
we only need to add the muon Lagrangian to the QED Lagrangian. The former is 
identical to that of the electron except the particle species must be distinguished. 
The interaction Lagrangian becomes 

£int = £int, e + Zint,/, = -#>"Ve ~ flVV^W (11.178) 

where the electric charge of the electron and muon is q = —e. The second-order 
generating functional Z 2 \ff, r], J] contains 

|[i/^ tat ] 2 =i[i/ d ^ tau + ^)] 2 

= \ {if *»**.)' + i (i/ ^)" + i / **jw/ ***„ 

(11.179) 

Therefore, the generating functional of e-fi scattering is readily obtained from 
Eq. (11.170): 

x - / d 4 xd 4 z l d 4 z 2 d 4 z^{rl(z 2 )S v {z 2 -x)(-iqy' 1 ) 
x S F (x - z 3 )r,(z 3 )D tliV ( X - z 1 )J v (z 1 )}Z \rf, r,, J]\ 

= i J d 4 xd 4 yd 4 z 2 d 4 z- i d 4 w 2 d 4 w i 

x {J}(z 2 )S 7 (z 2 - x)(-iqy")S v (x - Zi )rj(z 3 )} e D, /lp (x - y) 

x {rj(w 2 )S v (w 2 - y)(-iqyP)S f (y - WjMwj)},, Zo (11.180) 
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where we have distinguished the electron and the muon. To calculate the transi- 
tion amplitude of the electron-muon scattering, we assign the following kinematic 
variables: 

e( Pl )+ f i(p 2 ) -> e( Pi )+fi(p 4 ) (11.181) 

Application of the LSZ reduction formula Eq. (11.151) to e-/u scattering gives 

(out : p i p 4 \p 1 p 2 : in) = (-if I d' , x l d''x 2 d A x i d i, x 4 

x U pi ( Xi )(if - m e ) x J] p4 (x 4 )(if - m u ) X4 

/1\4 A4v r- I (11.182) 

\i) d7]f(x 2 )drj c (x 1 )6li e (x i )dli f '(x 4 )\- =ll= j =0 

x HT-™.U U pi (xi)(-if- m fl ) X2 U pi (x 2 ) 

The effect of the 4-derivatives in the reduction formula on the generating func- 
tionals Z e/i in Eq. (11.180) is given by 

d*Z t/ ,\ff, V ,J] I 



dt]f(x 2 )dt] e (x 1 )dr] e (x } )dr} f '(x 4 )\- =l]=J=0 

= i j d 4 xd 4 y[{S r ( Xi - x)(-iqyK)Sv(x - *)}, < 1L183 ) 

x D Vup (x - Y ){S?(x 4 - Y )(-iq Y p )S v (Y - x 2 )} fl ] 

Functional derivatives acting on Z do not contribute since they contain the source 
functions. As the effect of the Dirac operators in the reduction formula is only to 
convert the outermost propagators to the corresponding wave functions, the final 
expression for the scattering amplitude becomes 

(out: Pi,p 4 \pi,P2 :m) 

= i J d 4 xd A y {V pi (x)(-iqy")U pi (x)} t (11.184) 

x D Fup (x - Y ) {U p4 (Y)(-iqy p )U pi ( Y )} fl 

The formula is depicted in Fig. 11.6a. 
When we insert the expressions 

[/,(*) = u r (j>)6-*-« , V p > = u 5 (p')(x)e^ (11.185a) 



f d A k ,.,, v „. -g uv 



(11.185b) 



(where we have recovered the subscript for the spin degree of freedom) in this for- 
mula, the integration of x and y produces two d -functions, and after fe integration 



(a) (b) (c) 

Figure 11.6 (a) Electron-muon scattering and (b,c) Compton scattering. 

the scattering amplitude becomes 

(out : p^ p 4 \p u p 2 : in) = (2jr) 4 d 4 (]>i + p 2 - p 3 - p 4 ) 

X [W s ( ?3 )Hgy>r(j>l)k ~' g r 2 " ■ lUn(P4)(-iqY")»m(P2)} f , 

(Pi- PiV + lE 

(11.186) 

This reproduces the scattering amplitude Eq. (7.11) obtained using the standard 
canonical quantization method. 

Problem 11.5 

Derive the Compton scattering amplitude and confirm it agrees with Eq. (7.51). 
Hint: As the photon source J fl has no directionality, it could serve either as a source 
or a sink. There are two ways to make the photon derivatives, producing two distinct 
ing processes, as depicted in Fig. 11.6b,c. 



11.6.7 

Connected Diagrams 

In the expansion of the generating functional, we saw that it contained the discon- 
nected diagrams that describe two (or more) independent processes. We will show 
that the generating functional W produces only connected diagrams. It is related 
toZby 

Z[J] = e iv W or W[J] = -ilnZ[J] (11.187) 

We consider the case of the second-order interaction. The generating functional of 
QED was given by Eq. (11.157), which we reproduce here: 

Z^,,,J ] =Nexp[/^(-I-l)H^)(^)(^)] 

xZofi.V.J] ( n - 188 ) 

For ease of calculation we abbreviate the exponent by iL t and express Z[J] as fol- 

Z[J] = Z[Tf, V ,J] = NZoZ-h^ZolJ] (11.189) 
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Taking the logarithm of Z[J], we can Taylor expand to the second order of the 
interaction Lagrangian ilj, assuming it is small: 

\nZ[J] = InNZo + ln(Z - 1 e iL 'Z ) 

= InNZo + ln(l + Z" 1 (e iL > - l) Z } 

= lnNZ + ln 1 + Z" 1 (it, + -(iL r ) 2 + --- j Z \ 
= InNZo + Z - x jil; + ^y(iLj) 2 j Z - j {Zo" 1 (il,) Z } 2 + •■■ 
= lnN+ S + Si+ (Sz-^SM +••• (11.190) 

Recovering the original notation 
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where Z\ and Z 2 are the first- and second-order terms of Z[J] in the power ex- 
pansion of the interaction Lagrangian defined in Eqs. (11.162) and (11.168). The 
disconnected diagram that appeared in the second-order interaction was given by 
Z 21 = | (§±) Z [see Eq. (11.169)], which is exactly canceled in In Z[J], as is seen 
in the parentheses in the last equation of Eq. (11.190). Higher order disconnected 
diagrams can also be shown to be canceled by the corresponding products of lower 
order contributions. 



11.7 

Faddeev-Popov's Ansatz* 

This section is provided to explain why a ghost field is necessary in the covariant 
formalism of non-Abelian gauge theory. The discussion is formal and mathemati- 
cal. The reader is advised to read Chap. 18 first and then come back here. 



11.7.1 

A Simple Example-' 1 

The concept of Faddeev and Popov's gauge-fixing method is rather hard to grasp. 
However, the method constitutes an important concept for handling symmetry in 
the path integral and becomes an indispensable tool when we extend our discus- 
sion to non-Abelian gauge theory. Therefore we shall make a small detour, spend- 
ing a few extra pages rederiving the gauge-fixing condition that could be phrased 
in one sentence for Abelian gauge theory. Perhaps it helps if we do a simple exer- 
cise that follows their strategy closely to isolate the redundant degrees of freedom. 
Consider an integral 

W = jj dxdyG(x,y) = jj drG(r), r = J ' x* + Y 2 (11.192) 

Here, we assume G(x, y) is a function of rand has rotational symmetry. The nature 
of the symmetry is apparent in polar coordinates: 



-fdofd 



' drrG(r) (11.193) 

The integral is invariant under coordinate rotations 

r= {r,0)-> r,p = (r, + 0) (11.194) 

The essential part of the integrand that does not change by rotation is isolated as a 
function of r. The symmetry manifests itself in that it does not depend on 6 and 
that 6 integration merely adds an integration volume over the symmetry space, 
which is the cause of an infinity in the path integral. Our aim is to embed this 
symmetry condition in the original coordinates. To do that we insert 

1 = f d<pd(cp-6) (11.195) 

Then Eq. (11.193) becomes 

I drd6d(<p-d)rG(r) (11.196) 



.fitft 



By this insertion, the second integration has been reduced from a two-dimensional 
to a one-dimensional one by the delta function, which constrains the two-dimen- 
sional integration to be carried out at a fixed angle 6 = </>. 

More generally, a condition f(r) = that fixes the angle at some value of <p may 
be used. <p can vary as a function of r as long as f(r) = gives a unique solution 
of rp for a given value of r. As f dcpd[f(r)] / 1, we define a quantity A j(r) by 

A f [r)fd<l>6[f(rt)]=l (11.197) 

1 ) Here we closely follow the arguments of Cheng and Li [97] . 
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Then 



[Af(r)]- 1 ee f d<p «5[/(r )] = f d</> [ & -£\ <5[0 - 9(r)] 



A j(r) itself is invariant under the rotation because 

[A/fv)]" 1 = / d<pd[f(r, p+rp ,)] = J d<p"d[f(r r )] = [A^r)]" 1 

(11.199) 
Insertion of Eq. (11.197) in Eq. (11.192) gives 

W = j d<j>W $ = j dAj drG(r)A f (r)d[f(r)}] (11.200) 

Wcf, is rotationally invariant because 

WV = / drG(r)A f (r)d[f(r, p/ )] = j dr' G(r')A f (r')d[f(r')] 

= W^ 

where r' = (r, 0') and we have used the fact that both G(r) and A/(r) are invariant 
under rotation. We have succeeded in isolating the symmetry-independent volume 
factor from the symmetry-invariant integral. 

11.7.2 

Gauge Fixing Revisited- 

In gauge theories, we consider a local gauge transformation 2 ' 

A M -» A£ (11.202a) 

A'" = U(w)A fl U _1 (w) - - Utcojd,, U _1 (co) (11.202b) 

g ' 

[/(«) = exp \-ijr m a (x)T a \ (11.202c) 



p[-*f>.(*)T.] 



In the following discussion, we consider only an infinitesimal transformation, but 
the conclusion is general. Then the transformation is written as 

A^' u = A f ' a + f ahc io b A f ' c + -d f 'w a (11.203) 



where a,b,c (a,b,c = 1 • • • N c ) are color indices for non-Abelian gauge transfor- 
mation and fabc's are structure constants of the unitary group representation. For 
Abelian groups, f a y c = 0. For a given set of fields A M a (x) in the gauge-invariant 
Lagrangian, the action is constant on any path of Al <0 (x) that can be obtained by 
the gauge transformation U(u>). This will naturally give a infinite volume because 
the path integral is over the whole range of A, including A°"s that are gauge trans- 
formed from A. We want to separate this part of the path integral. The procedure 
will be exactly the same as in the simple example we have just discussed. First we 
have to fix the gauge at some value. The gauge condition is expressed generically 



g a (A„) = 0, i.e. g a (ApU =0 = 


(11.204) 


Some commonly used gauge-fixing conditions may be represented i: 


ti the form 


Lorentz gauge: d^A 1 ^ = 


(11.205a) 


Coulomb gauge: V • A a = 


(11.205b) 


Axial gauge: A\ = 


(11.205c) 


Temporal gauge: A° a = 


(11.205d) 


The generating functional of the gauge fields is expressed as 




Z = [vAe lS W 


(11.206) 


Then we insert 




A[A"]jvwd[g(A«)] = l 


(11.207) 



which is analogous to Eq. (11.197) and defines A[A'"]. The generating functional is 
transformed to 



= [vAJwfl 



n c (11.208) 

vadw = n n n dA > p > n n d ^y°) 

Notice, however, that Eq. (11.207) is a functional integral and "the delta functional" 
should be considered as an infinite product of d -functions, one for each space-time 



^# ffl )i=nn^te«( r M)) 
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Namely Eq. (11.207) should read 



= (n/^to(A-)])to(^) 



: n includes both color and the discrete space-time point. Then 
MA]' 1 = f\Y\j doj a (x»)d(g a (A"(x»))) 



fl(*).-.*N,(*)] 



The last step defines the functional determinant of the continuous matrix dco a (x)/ 
3gi,(y), with rows labeled by (a, x) and columns by (b, y). The gauge condition 
g a {A(x)) = can be rewritten as g a (A M (%))| m=0 = 0. A[A] is then expressed as 

or A satisfying g(A) = (11.212) 

The functional determinant in Eq. (11.212) can be calculated using Eq. (11.203). 
Clearly, A(A ra ) is gauge 



because, in Eq. (11.207), if we perform another gauge transformation gaiA 03 ) — >• 
g a (A wJrl ° ), the group nature of the measure means Vw = V(a> + a') and hence 

A[A»+»']= / ' Va>d[g(A<»+ m ')]= f V(co + co')d[g(A«+«')] 

J J (11.214) 

= j Vto"d[g(A"")} = A[A'"] 

At this stage, we can look at the generating functional Z in Eq. (11.208) and 
change the order of ©A and Vco. Since e' s[A) , A[A 0J ] are all gauge invariant, they 
can be replaced by e' s [ A '"], A[A ra ]. Now we change the integration variable to T>A 0J , 
but the new integration variable can be renamed back to A, leading to 



Z = J VAe is{A] I Vco A[A fl, ]<5[g(A' u )] 
= f Va>\ /©Ae is[A1 A[A]<5[g(A)] 



where we have been able to take out the factor Vco since the integral in large square 
brackets is independent of (o. This is the Faddeev-Popov ansatz [136]. The integra- 
tion over w simply gives an overall multiplicative constant to Z and can be absorbed 
in the normalization constant. Thus we have successfully isolated the integration 
over redundant degrees of freedom originating from the gauge symmetry. In order 
to obtain a more tractable expression, however, we go one step further to manipu- 
late Eq. (11.207) before we put it back into the generating functional. 
The identity Eq. (11.207) can be generalized to 



A(A)fvwf\d[g a (A" J )-h a (x)] = l 



where h a (x) is a set of arbitrary functions. This merely redefines g a to make the 
gauge condition g a (A'"(x)) = h a (x). As h a (x) is independent of A'', A[A] is un- 
affected. Now multiply both sides by an arbitrary functional H[h] = H[hi(x),-- ■ , 
h N (x)\ and integrate over all h a (x). The result is a more general identity: 

JVMY\ N a=l d [g a (A°>) - h a (x)] H[h] _ fVwHWA")] _ 

A[A] JVhH[h] " A[A1 JVhH[h] - 1 



Now insert this identity instead of Eq. (11.207) in the generating functional Z, and 
as before change the order of ©A and Dm. The same arguments we used in ob- 
taining Eq. (11.215) can isolate fT>co and we obtain 






I VAe' s ^H[g(A)]= N \ VAe ,s ^H[g(A)]A[A] 



1 'fT>hH[h]J 

(11.218) 

) evaluate A[A] and fix H[g(A)]. 

11.7.3 
Faddeev-Popov Ghost- 

Lorentz Gauge 

To proceed further, we need to fix the gauge condition. Let us choose the Lorentz 
gauge, namely 

ga (A) = d fl A f ' a (11.219) 

Then for A satisfying 9„ A" = 

ga(A m ) = d„ U" a + -d"m a + f ahc 0J h A"\ 

V g ' (11.220) 

= -d /l d f '0J a + fabcA^df.COb 
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- \-d ab d ft d>' + /b&cA^J 6(x - y) 



*<»(Y)L=o 



For the Abelian field, /„j, c = and A [A] does not depend on A. It can be absorbed 
in the normalization constant. However, for the non- Abelian field, A [A] depends on 
A and is therefore a dynamical object. It cannot be absorbed by the normalization 
constant. A convenient way is to rewrite it as a path integral in terms of anticom- 
muting c-numbers. We cannot use the ordinary commuting c-numbers because 
the determinant has to be inverse in that case. We refer to Eq. (11.106), which is 
reproduced as the next equation: 

jvri(x)V V (x)exp \-i j d A xd A yi 1 *(x)A(x,y)i 1 (y)\ = det iA (11.222) 
Using this formula, we have 

A[A] = J VTj(x)V V (x) exp [i J d A x£ ghos ^ 
£gw = -^:(x)[<5 flh 9,,3" + gf abc A" c (x)d u ] Vh (x) 

The factor 1/g is absorbed into the normalization of the r] and If fields. By ex- 
pressing the determinant as a path integral, we have introduced a new field. It has 
peculiar characteristics: it is a scalar field obeying the Klein-Gordon equation but 
is anticommuting. It is called the Faddeev-Popov ghost. As it has a color index, 
there are as many (i.e. N c ) ghost fields as gauge fields. It interacts only with the 
gauge field through derivative couplings. Since there is no reason to consider it as 
a physical particle we do not introduce an external source and hence it only appears 
in the intermediate states, namely, in closed loops in Feynman diagrams. 

We still have the freedom to adopt an arbitrary functional H[h] and we choose it 
to be a Gaussian integral 

H[h] = exp \-±-Jd A xg 2 ~} = exp \~ j d 4 x(3„ A"f\ (11.224) 

It amounts to changing the gauge field Lagrangian 

X -> X - ^r@»K) 2 ( n - 225 ) 

which is exactly the gauge-fixing term we introduced before to circumvent the dif- 
ficulty (see Eq. (11.59)). 



In summary, the final form for Z, taking note of Eqs. (11.223) and (11.224), takes 
the form 

Z= N J VAVrjVr] exp \i J £^1 (11.226a) 

^nAbel = £free + ^GF + ^FPG (11.226b) 

= ^free - Jj-( d f<K) 2 ~ TM ah r] h (11.226c) 

M ab = d ah d fl d^ + fahcA^M (11.226d) 

Axial Gauge 

The ghosts have the wrong spin-statistics and appear only in intermediate states. 
Their existence, however, is essential to retain the unitarity of the theory. This is 
analogous to the scalar photon with indefinite norm, i.e. negative probability, that 
appeared in the covariant Lorentz gauge of the Abelian theory. In the non-Abelian 
gauge, for instance in quantum chromodynamics (QCD), similar situations occur 
for scalar and longitudinal gluons. In addition, because of the nonlinear nature 
of the non-Abelian theory, they couple to themselves. The creation of unphysical 
gluon pairs cannot be canceled by themselves. The ghost's role is precisely to com- 
pensate these unwanted contributions and to maintain the unitarity of the theory 
order by order. In the Abelian case, when the Coulomb gauge was adopted only 
physical photons appeared and we did not need unphysical photons. Similarly, we 
may ask if there is a gauge in non-Abelian gauge theory in which the ghosts do not 
appear. Indeed there is: the axial gauge, 3 ' where A 3 a (x) = 0. More generally, we 
define the axial gauge with a unit vector J'' that satisfies the condition 

t,,A'; = 0, *,,*'' =±1 (11.227) 

Then the gauge-fixing term is 

g a (A) = t fl A" a (11.228) 

and under the gauge transformation Eq. (11.203) 

dg a [A 0J ] = fabcA^Oic + fd^COa = f3 /( £U a (11.229) 

Hence 

P*- = 6,iPd M (11.230) 

da> b 

This does not contain A», and its contribution can be integrated out. We do not 
need ghosts in the axial (or temporal) gauge. However, just like the Coulomb gauge, 
the Feynman propagator is complicated in these gauges and calculation is more 
difficult. 

3) By choosing t ; , to be timelike, the temporal gauge is included also. 
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Accelerator and Detector Technology 



Accelerators 



Particle physics explores the deepest inner structure of matter. The smallest size 
we can explore is directly proportional to the energy we can transfer to the par- 
ticles. For this reason, accelerators of the highest possible energy are the major 
apparatus one needs. With them we can boost particles, make them collide and de- 
compose and investigate their inner structure by looking at their debris. The role 
of detectors is to identify secondary particles, reconstruct their space-time struc- 
ture by measuring energy-momentum and elucidate their nature. Particle physics 
is a science based on observation and experimentation. Its progress is inextricably 
related to the advent of technologies. The ability to devise an experiment to test 
theoretical hypotheses and to understand connections between natural phenome- 
na and predictions induced from first principles are attributes that researchers in 
the field have to acquire. To interpret experimental data correctly, it is essential to 
grasp the essence of experimental techniques and their underlying principles and 
to recognize their advantages and disadvantages. 

There are numerous examples from the past in which people, confused by false 
data, have taken a roundabout route. Acquiring correct knowledge of experimental 
techniques and cultivating one's own critical power help to assess the observed da- 
ta correctly and enhance one's ability to devise new methods that can lead to new 
ideas. However, the general knowledge that is necessary for the experimental tech- 
niques is mostly based on electromagnetism, condensed matter physics, and the 
physics of molecules, atoms and nuclei. These are, to say the least, detached from 
elementary particle physics. To avoid writing a manual-like textbook on experimen- 
tal methods by going into too much detail, I restrict myself here to describing only 
basic ideas and a few examples of representative accelerators and detectors. In this 
chapter we discuss only experimental tools. Important experiments are introduced 
whenever possible as we pick up each subject throughout the book. 
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12.2 

Basic Parameters of Accelerators 1 ' 

12.2.1 

Particle Species 

The basic idea of particle acceleration is to apply an electric field to charged parti- 
cles. Let the charge of the particle be q, the applied voltage V (in volts) and assume 
the field is uniform. Then the particle is accelerated to energy 

E = qV = qed (12.1) 

where e = dV/dt is the gradient of the electric potential and d the length of the 
field. When a charged particle has unit charge (= 1.6 x 10 -19 Coulomb) and is 
accelerated by 1 Volt of electric potential, we say it has acquired the energy of 1 
electron volt (eV for short). The electron volt is a small unit, and we list below larg- 
er units and their applied physics in that energy region: 

keV = 10 3 eV 
MeV = 10 6 eV 



GeV = 10 9 eV 



TeV = 10 12 eV 



X-rays, structure of atoms and molecules 

nuclear reactions 

m„ (pionmass) = 139.6 MeV/c 2 

hadron size ~ ti/m^c ~ 1.4 x 10~ 13 cm 

m p (proton mass)= 0.938 GeV/ c 2 

m w (Wbosonmass) = 80.26 GeV/c 2 

(V2Gf) — x ) (energy scale of electroweak interaction) 

= 250 GeV 

typical energy of particle physics frontier 

Note: total energy of LHC 2 ' = 14 TeV 



Particles that can be accelerated are usually limited to stable charged particles, 
which are electrons (e) and protons (p), including their antiparticles. When one 
wants to use a beam of unstable particles such as pions (jt) , kaons (K) and hyperons 
(Y), one irradiates protons on matter (beryllium, iron, etc.) and uses the secondary 
particles that are produced. In the case of neutrinos (v) and muons (/<), tertiary 
particles, which are the decay products of secondary particles, are used. The energy 
of the secondary or tertiary beams is usually less than half that of the primary and 
the intensity is of the order of 10 -s -10 -7 of the primary. Until the unified theories 
were established, strong, electromagnetic and weak interactions were considered 
as independent phenomena and experiments with different particle species were 
considered as belonging to different research fields. Historically, hadron beams 
(p, jt, K) on fixed targets were very powerful in discovering new particles and res- 
onances. Charting and classifying them (hadron spectroscopy) was an important 

1) [266]. 

2) The Large Hadron Collider (LHC) is a proton-proton collider at CERN that started operation in 
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discipline of the field. The hyperon beam [247], a secondary hadron beam that con- 
tains a large fraction of hyperons, contributed to measuring the magnetic moment 
of hyperons and elucidating their decay properties. Neutrino beam experiments in 
the 1970s [145] were used to discover the neutral current and, together with in- 
tense electron beam experiments, clarified the quark-parton structure of hadrons. 
Both approaches, spectroscopy and dynamics, were instrumental in establishing 
the unified theories. Muon beam experiments [124] extended the knowledge ob- 
tained from the electron beam to the deeper inner structure of the hadron and 
enhanced the accuracy obtained by the neutrino experiments, thereby clarifying 
the quark-gluon structure, and contributed to establishing QCD (quantum chro- 
modynamics). 

With the advent of the unified theories, however, it became possible to interpret 
various phenomena produced by different particles in a unified way. From this 
viewpoint, the energy is the most important parameter and the particle species 
plays a secondary role. 3 ' The discovery of new particles at the energy frontier 
has been accorded the foremost importance in testing the correctness of proposed 
models by a potential breakthrough. In terms of energy, colliding accelerators are 
far more advantageous than those utilizing fixed targets. From the accelerator tech- 
nological point of view, the proton is much easier to accelerate to higher energies 
than the electron. Consequently, hadron colliders (usually p on p or p) have an 
advantage compared to electron colliders as the energy frontier machine. Howev- 
er, the hadron is a quark composite while the electron is elementary. Therefore 
hadron-associated phenomena have an intrinsic difficulty in their analysis and the- 
oretical interpretation. It is also not clean experimentally, because the hadron is 
a strongly interacting particle and produces a great number of soft gluons, which 
appear as QCD background to a reaction one wants to investigate. The search for 
a new particle in hadron reactions is literally looking for "a needle in a haystack". 
Thus electron machines are usually more advantageous in doing precise and de- 
cisive experiments to test theoretical models, except for those unreachable in the 
electron machine. 

Today, fixed target experiments are carried out mostly in cases where the intensity 
of the beam and the species of the particle are the most important assets, namely 
when high precision in flavor physics is at stake. Examples are CP violation tests, 
rare decay modes in kaons and B-mesons, and neutrino and muon experiments 
to probe lepton flavor mixing. Generally speaking, for collider experiments at the 
energy frontier, a general purpose detector that is a combination of a variety of 
devices covering the largest possible solid angle is required. On the other hand, a 
clear and definite purpose is ascribed to fixed target experiments. They are most 
often accompanied by unique and well thought out measuring devices specific to 
the purpose. 

3) To be fair, since the Standard Model has been established, the origin of particle species is an 
unanswered question. The universe can live with just the first generation (u, d) quarks and (v e , e) 
leptons and the existence of the second and third families remains an unsolved "flavor problem". 
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12.2.2 
Energy 

The highest energy is the most fundamental parameter of an accelerator because 
it determines the minimum size of the object being probed. Equally important is 
its luminosity, which determines the number of events one can sample. It may 
be compared with the clearness of an image if the energy is the magnification of 
a microscope. The minimum scale (1) one can probe with an accelerator having 
energy E is related to the de Broglie wavelength of the incident particle: 

p E V?/2[GeV] l ' 

E is the energy of colliding particles in the center of mass system. Throughout 
in this book, we use ",/s = IE" to denote the total center of mass energy, s is 
a Mandelstam variable and is defined by the following formula using the energy- 
momentum of two colliding particles (see Sect. 6.4): 

s = c 2 ( Pl + p 2 ) 2 = (E 1 + E 2 ) 2 - ( Pl c + p 2 c) 2 (12.3) 

In the collider accelerator, p t = —p 2 and ~/s = Ei + E 2 , the total energy of colliding 
particles. 4 ' For fixed target experiments p 2 = and when E\ 3> m,\C 2 , m 2 c 2 



V~s = ^/(E 1 + m 2 c 2 )-(p 1 c) 2 ~ V2E lf n 2 c 2 (12.4) 

This means the equivalent center of mass energy of the fixed target experiments 
increases only proportionally to the square root of the accelerator energy and is 
inefficient compared to the collider accelerators. This is why, with a few exceptions, 
most of the important results in the last 30-40 years have been obtained in collider 
experiments. 

Problem 12.1 

The energy of the Tevatron at Fermilab in the USA has ~/s = 1.96 TeV = 
(980 + 980) GeV. Derive the center of mass energy when one uses this beam 
for a fixed target experiment and prove it is not possible to produce a W boson 
(m w = 80.4 GeV). Assume a hydrogen target, m p = 0.938 GeV. 

Note the Tevatron [343] started as an accelerator with a beam energy of 400 GeV 
for fixed targets in 1973 and W production was one of its major aims. 



Effective Energy of Proton Colliders The proton is a composite of quarks, and the 
energy that participates in the reactions is that of the constituent quarks. Assuming 

4) For asymmetric colliders ( Pl ^ - Pl ) such as B-factories (KEKB and PEP II), */~s has to be 
calculated using Eq. (12.3). The B-factory is a high luminosity accelerator that operates at 
-,/s = 2 m B ~ 10 GeV designed to produce copious B-particles, which contain fa-quarks. 
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the quark in the proton has a fraction x of the parent proton momentum, the total 
energy VI of the colliding quarks is given by 



Vs=^ 



(XiPiC + X 2 p2C) 2 ~ V*1*2VS (12.5) 

The average fractional momentum (x) has been measured in deep elastic scatter- 
ings by electrons and muons (see Chap. 17) to be 0.1-0.3, which means 0.01 Vs < 
VS < J~s. However, the probability of quarks having X\ x 2 ~ 1 is not zero and 
reactions with Vs — v^ are not impossible. Whether the reaction is useful or not 
depends on the luminosity and the skill of experimenters in sorting out signals 
from the background. 

12.2.3 
Luminosity 

Events that a detector picks up are mixtures of signals and background. The lu- 
minosity L plays a critical role in extracting clean signals out of unwanted back- 
grounds. It is defined as the quantity that gives an event rate when multiplied with 



N(reaction rate/s) = La (12.6) 

The unit of I is "per square centimeter per second" (cm -2 s _1 ). The cross section 
a is predetermined by nature and is not controllable. Therefore the number of 
events, a decisive factor in reducing statistical errors, is directly proportional to the 
luminosity. When large background signals exist, one has to filter them out usually 
by applying cuts in various kinematical variables, such as momenta, angles and 
spatial as well as time locations. However it is not possible to make an ideal filter 
that is capable of cutting out all the unwanted events while passing all the signals. 
In most cases, the signals are reduced, too. When the background signals are 
abundant, the events have to go through many filters and the number of surviving 
signals critically depends on the luminosity. The luminosity of a circular collider is 
given by [240, 316] 



I= NiNif 
n h A 
Ni , N 2 : total number of particles in the beam 
/ : revolving frequency of the particles 

n\, : number of bunches in a circle 

(the beam is not continuous but bunched) 
A : the beam's cross-sectional area at the colliding point 



: beam amplitude at the interaction point 
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The beam emittance is defined as a phase space area in the (/?;,/?■ = dfii/dz)- 
plane of the particles in the beam, where z is the coordinate in the beam direction. 
Namely 



,-/« 



dp (12.7) 

Luminosities that can be achieved by electron and proton beams are slightly dif- 
ferent. L e = 10 34 cm -2 s — 1 has been achieved at B-factories, which is an approxi- 
mately 10 2 increase compared to their predecessors, and L p ~ 3 x 10 32 cm -2 s — 1 
has been achieved at the Tevatron. At LHC the designed luminosity is L p = 
10 34 cm -2 s _1 and up to 10 35 cm -2 s _1 is envisaged in the future. For fixed-tar- 
get experiments, the reaction rate N (s _1 ) can be calculated using the number of 
incoming particles (N in ) and target particles (N tgt ) as 

N = N m oN tgt = N in op£N A 

p : density of the target in g/cm 3 
£ : the length of the target in cm 
N A : Avogadro number = 6.022 x 10 23 mol -1 



L = N in plN A (12.9) 

As a typical example, taking p = 10 g/cm 3 , I = 1cm, N in = 10 7 s _1 (secondary 
beam) to 10 12 s _1 (primary beam) gives I = 10 32 to 10 37 s _1 . Until the end of the 
20th century, the luminosity at a fixed target was generally larger but the gap is 
closing fast. In electron collider experiments, a basic cross section is given by the 
process e~ e + — »• f,i+fi~. Note, in the Standard Model, for colliding e~ e + all the 
production cross sections by the electroweak interaction are of the same order and 
the following number is a good number to keep in mind: 

a ftft = o(e-e + -+/1+/1-) 

_ Sjz _ 87[nb] 10" 31 2 (12.10) 

~ Is ~ s[GeV 2 ] ~ s[GeV 2 ] [Cm 

where nb means nanobarn. At Js = 10 GeV, I = 10 34 cm -2 s _1 , N ~ 100 s _1 . 

Signal-to-Noise Ratio (S/N) As the proton is a quark composite, quarks that have 
not participated in the reaction (called spectators) escape from the parent proton, 
radiating soft gluons, which hadronize, producing jets and constituting the so- 
called QCD backgrounds. For this reason, clean signals (with large S/N ratio) are 
hard to obtain in hadron colliders. Suppose one wants to detect a Higgs meson. Its 
production cross section is ~ 10 -35 cm 2 . Furthermore, the decay branching ratio 
to lepton pairs (H° -*■ e~ e + , /x~fi + ), which are suitable for identifying the Higgs, 
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is ~ 10 -4 . In contrast, a typical hadronic cross section that appears as QCD back- 
grounds in hadron colliders is a ~ 10~ 25 cm 2 and with I = 10 32 to 10 34 cm -2 s _1 
produces events at a rate of N = 10 7 to 10 10 s _1 , a formidable number to cope 
with. The S/N ratio at its most naive level is estimated to be 10 -14 , a challenge for 
experimentalists. 

At electron colliders, production cross sections of interest relative to that of ee -> 
fi/,i are usually of the order of 10 -1 and that of hadronic production is ~ 3 — 4, 
therefore S/N ratios are much better. Generally speaking, electron colliders are ad- 
vantageous in that because the incoming particles are elementary theoretical in- 
terpretations are straightforward. Furthermore, the whole energy of the colliding 
electrons can be used to produce the wanted particles and the backgrounds are 
few. Therefore, signals are cleaner and easier to interpret theoretically. However, 
the electron, being much lighter than the proton (m e /m p ~ 2000), emits photons 
if bent by magnetic fields (synchrotron radiation). Synchrotron light source facili- 
ties take advantage of this and are extensively used in the fields of materials science 
as well as atomic, molecular and biophysics. From the high energy physics point 
of view, it is a pure loss. Since the loss increases as ~ E 4 , it is increasingly difficult 
to construct a high energy, high luminosity circular accelerator for electrons. It is 
believed that the LEP 5) at CERN with its energy ^/s = 200 GeV is about the limit 
of a circular machine. If higher energy is desired, one has to rely on linear col- 
liders. So far only one linear collider has been operational, the SLC at SLAC with 
>/s ~ 90 GeV, and its technology is not as mature as that of circular accelerators. 

In summary, with hadron colliders, innovative approaches to detector designs 
are required to cope with copious backgrounds, whereas with electron colliders 
developing advanced accelerator technology is the critical factor. With present tech- 
nology, when physics of the TeV region is to be investigated, hadron machines are 
easier to construct, but linear colliders of the TeV range are a must for future ex- 
periments and extensive R&D is being carried out. 



12.3 

Various Types of Accelerators 

12.3.1 

Low-Energy Accelerators 

One accelerator on its own, whatever its mechanism, cannot accelerate particles 
above an energy of about 1 GeV, and a variety of accelerators are utilized in cascade 
to achieve this. The final stage of a high energy accelerator complex, so far, with 
the exception of the SLAC linear accelerator (SLC), has always been a synchrocy- 
clotron, a kind of circular accelerator. It confines particles inside a circular pipe 
with fixed radius, places an accelerator tube at one point or two and accelerates 

5) An electron collider that operated from 1989 to 2000 with J~s = 2 m z (180 GeV) ~ 200 GeV which 
provided precision data to confirm the validity of the Standard Model. 
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the particles every time they pass the tube. As a single acceleration tube can be 
used many times, it is very effective and has a high performance-to-cost ratio. Elec- 
trons or protons emanating from a source are first accelerated by a static electric 
field accelerator (usually a Cockroft-Walton accelerator) to a few hundred thousand 
volts, then further accelerated by a linac to hundreds of million electron volts, and 
finally injected into a synchrotron. As the particles can be accelerated more effec- 
tively if the ratio of the final energy to the injected energy is small, two or more 
synchrotrons are used in cascade. The first of them is often called a booster or an 
accumulator and that in the final stage the main ring. 

Cockroft-Walton This type of accelerator uses a static electric field to accelerate 
particles and has the structure shown in Fig. 12.1. A combination condenser-diode 
circuit is arranged to work as a parallel connection in charging the capacitors and a 
series connection in discharging them. Figure 12.1 shows an eightfold multiplica- 
tion of the applied voltage. The Cockroft-Walton accelerator used for accelerating 
the 12 GeV proton synchrotron (hereafter referred to as PS) at KEK 6) produced 
750 kV. 



: 




Figure 12.1 Cockloft-Walton accelerator: The circuit is con- 
nected in such way that applied voltages are arranged in par- 
allel when charging and in series when discharging, giving the 
n-fold (n = 8 in the figure) voltage. 



Linac Protons accelerated by the Cockroft-Walton accelerator are injected into a 
linear accelerator (linac). It is composed of a series of electrodes in a cavity (a hollow 
tube made of copper), to which an alternating electric voltage is applied by a radio 
frequency (RF) supplier (see Fig. 12.2). 

The polarity of the applied voltage is arranged to decelerate the particles when 
they pass the cavities and to accelerate them when they cross the gaps between 



6) High Energy A 
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Figure 12.2 Proton Linear Accelerator: The a 
they cross gaps and a decelerating voltage is 
gain energy, their velocity increases and the c 



celerating RF voltage is applied to particles when 
ipplied when they are in the tube. As the particles 
irresponding tube is longer. 



them. They feel no voltage inside the cavity because of its electric shielding. The 
flow of injected protons is continuous, but only those synchronized with the ap- 
plied voltage are accelerated and consequently they emerge bunched as a series of 
discontinuous bundles. The length of the cavity needs to be longer as the parti- 
cles proceed because they fly faster each time they are accelerated. The accelerating 
voltage is usually limited to a few MeV (million electron volts) per meter, and an 
energy of 100-200 MeV is reached at the final stage. Electrons acquire near-light 
velocity at a few MeV, so the length becomes constant after a meter or two. The 
longest electron linac is SLC at SLAC (Stanford Linear Accelerator Center), which 
has Vs = 90 (45 + 45) GeV, and a length of 3 km. It uses 230 klystrons (large vac- 
uum-tube RF amplifiers) and makes two beams of electrons and positrons collide 
60 times per second. The positrons are made by bombarding high-energy electrons 
(34 GeV at SLC) on matter, returning them to the starting point and re-accelerating 
them using the same accelerator tube a half-period of the RF later. 

Cyclotron A charged particle with its velocity perpendicular to a uniform magnetic 
field makes a circular motion. When the particle with mass m, charge q and mo- 
mentum p moves on a circular orbit of radius R in a magnetic field of strength B, 
the equation of motion with centrifugal force supplied by the magnetic field is giv- 
en by 



p[GeV/c] 



= qBR = Q3BR[T r 



where T stands for tesla. The equation is valid at relativistic energy if the relativistic 
mass (m ^ my = m/y/l — (v/c) 2 ) is used. The radius of curvature is proportional 
to the momentum and the inverse of the field strength. The angular frequency is 
given by 



= 2jt- 



(12.12) 



and is referred to as the cyclotron frequency. When the particle motion is nonrel- 
ativistic (v <§; c), w is a constant if the magnetic field strength is constant. In the 
cyclotron a pair of half-cut hollow discs called D-electrodes or "dees" are placed in 
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RF VOLTAGE 







Figure 12.3 Layout of a cyclotron: (a) The 
cross section. Two electrodes, the dees, are ir 
the magnetic field, (b) RF voltage is applied 
between the two dees to accelerate particles 



when they pass the gap. The orbital radiu 
grows as the particles are accelerated, bu 
revolution frequency stays the same. 



a uniform magnetic field (Fig. 12.3a) and a RF voltage is applied across the dees. 
The particles, injected near the center of the magnetic field, are accelerated only 
when they pass through the gap between the electrodes. The combined effect of 
the magnetic field and the acceleration forces the particles to travel in a spiral path 
and eventually leave the dees (Fig. 12.3b). 

Synchrocyclotron As particles are accelerated, their mass increases, the cyclotron 
frequency is no longer constant and their circular motion is not synchronous with 
the accelerating field. The cyclotron's acceleration is effectively limited to v/c < 0.1 
or in the case of protons about 15 MeV. To overcome this limit, either m or B has to 
be changed. The synchrocyclotron increases the maximum energy by decreasing 
the frequency as the mass increases. The disadvantage of this method is that only 
clusters of particles can be accelerated. Once the RF is synchronized to an injected 
cluster, the next cluster has to wait until the preceding cluster has reached its min- 
imum frequency and is extracted. The intensity is reduced to a fraction (~ 1/100) 
of that of the cyclotron. 

Isochronous Cyclotron In the isochronous cyclotron, in contrast to the synchrocy- 
clotron, the RF is kept constant and the magnetic field is synchronized with the 
mass increase. This is achieved by dividing the poles of the magnet into concentric 
sectors, and this type of cyclotron is referred to as a sector or ring cyclotron. The 
particles fly freely between the sectors and their trajectories are bent only when 
they pass through the sector magnets. The shape of the sectors is designed to keep 
the combined total flight time in free space and in the sectors constant as the par- 
ticle becomes heavier. Thus the sector cyclotron can achieve high energy (several 
hundred MeV for protons) without loss of intensity. Figure 12.4 shows an example, 
the 590 MeV Ring Cyclotron of PSI (Paul Sherrer Institute in Switzerland), from 
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Figure 12.4 (a) PSI 590 MeV Ring Cyclotron. 
The beam is accelerated first to 870 kV by a 
Cockroft-Walton accelerator, then pre-accel- 
erated to 72 MeV by a small ring cyclotron 
and injected into the 590 MeV Ring Cyclotron, 
(b) Floor layout of the Ring cyclotron. Four 
accelerating tubes (KAV1-4) and eight sec- 



tor magnets (SM1-8) are shown. The sector 
magnets are of C-type with the open edge di- 
rected toward the center. The beam comes in 
from the left, is injected into the inner edge 
of SM3 at the upper right and extracted from 
AHB at the upper right. Photo courtesy of Paul 
Scherrer Institute [328]. 



which a 2 mA DC beam can be extracted [328]. The highest energy is limited by the 
cost of making giant magnets. 



12.3.2 
Synchrotron 



As is shown by Eq. (12.11), if one wants to increase the energy, either B or R or 
both has to be increased. The synchrotron keeps the radius of the circle constant 
by increasing the strength of the magnetic field. If one wants to go higher than, say, 
1 GeV energy, it is the most practical method. It is accompanied by one or more pre- 
accelerators because a certain amount of energy is necessary to keep particles on 
a circular orbit of a definite radius. Pre-accelerators usually consist of a Cockroft- 
Walton accelerator followed by a linear accelerator. Sometimes two synchrotrons 
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Figure 12.5 CERN accelerator complex: 
Protons are accelerated first to 50 MeV by 
a linac, then to 1.4 GeV by the booster, to 
25 GeV by the PS (Proton Synchrotron), to 
450 GeV by the SPS (Super PS) [133] and fi- 
nally to 7 TeV by the LHC main ring. ISOLDE 
is used to study unstable nuclei. Part of the 



beam is used for fixed-target experiments. For 
instance, CNGS (CERN Neutrinos to Gran 
Sasso) is a neutrino beam for neutrino oscil- 
lation experiments carried out at LNGS (the 
Gran Sasso National Laboratory) of 1NFN (Is- 
tituto Nationale de Fisica Nucleare) in Italy 
730 km away. Taken from [93]. 



are used in cascade to bring the energy even higher. This is because the accep- 
tance of the injected beam can be made higher if the ratio of injected to maximum 
energy is lower, and there is a net gain in intensity. For this reason the first accel- 
erator is called a booster and the last the main ring. Figure 12.5 shows CERN's 7 ' 
accelerator complex [93]. Protons are first accelerated to 50 MeV by a linac, then to 
1.4 GeV by the booster, to 28 GeV by the PS (Proton Synchrotron), to 450 GeV by 
the SPS (Super PS) and finally to 7 TeV by the LHC main ring. PS was the main 
ring between 1960 and 1980 and is still used for a variety of experiments. SPS was 
built originally as an antiproton-proton collider and achieved the historical discov- 
ery of W and Z° bosons. It also served as an intermediate accelerator for electrons 
and positrons to be injected into the Large Electron-Positron Collider (LEP). LEP 
also used to be the LHC main ring; it operated from 1989 to 2000 at Js = m z 
and later at J~s = 200 GeV. It was the largest electron collider and instrumental in 
establishing the Standard Model of particle physics. 



Elements of the Synchrotron The magnets that are used to keep the particles on 
their circular trajectories are of dipole type, like the one shown in Fig. 12.6a, or 
C-type, obtained by cutting the magnet at the center. 

The accelerating tube is a RF cavity and is placed at a point on the circle. The fre- 
quency is synchronized so that it accelerates the particles every time they pass the 

7) European Organi: 
French name Cor 



12.3 Various Types of Accelerators I 375 





(HI) 






(c) 

Figure 12.6 Cross sections transverse to 
the particle path of a typical dipole (a) and a 
quadrupole magnet (b). Thin lines with ar- 
rows denote magnetic field directions and 
thick arrows denote magnetic forces. In the 
dipole, the force is one directional, but in the 



quadrupole, it is inward (converging) hori- 
zontally but outward (diverging) vertically, 
(c) A pair of dipoles can make the particle ra 
convergent in both the x and y directions, ac 
ing like a combination of convex and concav 



RF cavity. With present technologies, the maximum field strength of the magnet 
is limited to ~ 1.4 T (normal conducting) or 3T (Tevatron) to 10 T (LHC) (super- 
conducting). 8 ' Let us consider the cases of the Tevatron (980 + 980 GeV) and LHC 
(7 + 7TeV). Inserting B = 3T (3.8 T), R = 1 km (2.8km), Eq. (12.11) tells us that 
E ~ pc ~ 900 GeV (7TeV). Namely, the size of the accelerator becomes larger 
in proportion to the maximum energy. The revolution frequency at the Tevatron 
(LHC) is ~ 10~ 5 (10~ 4 ) and the accelerating voltage is ~ 1 MeV; it takes ~ 10 s 
(~ 700 s) to reach the maximum energy. When a continuous beam is injected into 
the linac, it becomes bunched in the synchrotron with a cluster length of ~ 1 mm. 



Beam Stability Usually particles in the beam have dispersion (spread) in their ve- 
locity as well as their position. The accelerator will work only when the particles 
that are not exactly synchronous with those in the central orbit tend to converge 
or oscillate around the center. If their motion diverges, they will not be accelerated 

i) Betatron Oscillation Stability Transverse oscillation relative to the orbit is called 
betatron oscillation. To control the betatron oscillation and keep the particles on 
the orbit, quadrupole-type magnets are used. They have the structure shown in 
Fig. 12.6b. With the configuration depicted in the figure, the magnetic field exerts 
a converging force (focus, F) on the particles in the x direction and diverging force 



several hundred tesla have been achieved. Here, w( 
of large size and capable of stable operation. 






short time, magnetic field strengths of 
talking of industrial level magnets that are 
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Figure 12.7 Synchrotron oscillation and phase and its orbital radius becomes larger (small- 
stability. Point O denotes where the revolu- er). Therefore, it takes more (less) time to go 
tion frequency and the acceleration frequency around and arrives later (E') (earlier, L') at the 
synchronize. A particle which arrives early (E) next period, 
(late, L) receives larger (smaller) acceleration 

(defocus, D) in the y direction. The quadrupole magnet is a kind of convex lens and 
concave lens in optics in one. Just as a combination of concave and convex lenses 
placed at a distance d has a combined focal length F, which is given by 



' h h hh 



(12.13) 



a pair of quadrupole magnets acts as a focusing device. If we take, for example, 
fi = — f 2 = /, then F = d/f 2 and the pair can exert a converging force in both 
the x and y directions (Fig. 12.6c). In the synchrotron dipole magnets are placed at 
equal distances to keep the particles moving on the circular orbit and quadrupoles 
are distributed between the dipoles in a configuration (FDFDFD. . .). The use of 
many FD combinations is called strong focusing. 

ii) Phase Stability Longitudinal oscillation in the beam direction around the cen- 
tral point synchronized with the applied RF voltage is called synchrotron oscil- 
lation. Suppose the circular motion of particles at point O (see Fig. 12.7) is syn- 
chronous with the RF frequency and they come exactly back to point O after a 
period. The phase of point O is called synchronous phase. Particles at E, which ar- 
rive earlier, experience higher acceleration, which causes their orbit to expand and 
consequently they take more time to circulate, arriving later at E' in the next period. 
On the other hand, particles at L, arriving later, receive less acceleration and arrive 
earlier, at I/, next time. Therefore the particles' phase tends to converge to the syn- 
chronous phase and thus the beam can be accelerated stably. The convergence of 
the longitudinal motion in the particles' orbit is referred to as phase stability. 

Synchrotron radiation, which is described in the next paragraph, attenuates both 
the betatron and synchrotron oscillation and thus has the effect of stabilizing the 



Synchrotron Radiation Synchrotron radiation is the phenomenon that a particle 
emits light as its path is bent by the magnetic field. Because of this, the energy 
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cannot be made arbitrarily large proportional to the radius. At some point the en- 
ergy loss due to synchrotron radiation becomes so large that the compensation of 
the energy becomes technically formidable. The energy loss A E per turn of the 
synchrotron radiation is given by 

A £[ MeV]~ - 0885 < £[GeV] > 4 : for the electron 
R[m] 

At LEP, R = 2804m and E = 100 GeV, which gives A£ = 3.2GeV/turn. 
Power/RING = 16 MW. At LHC, E = 7 TeV, which gives A E = 6.7 keV/turn. 

Synchrotron radiation is almost monochromatic and collimated, which makes 
a high quality light source but is a loss as an accelerator. The proton being 2000 
times heavier than the electron, the synchrotron radiation loss in the proton ma- 
chine is (2000) 4 ~ 10 13 smaller and hence has never been a problem. LHC is the 
first machine to consider the loss seriously, but at E = 7 TeV, R = 2804 m the 
energy loss is a mere 6.7 keV/turn. This is the main reason for the technologically 
available maximum energy of the electron accelerator being roughly a tenth that of 
the proton machine. 



12.3.3 

Linear Collider 

As the energy loss due to synchrotron radiation increases rapidly, like ~ E 4 , and be- 
cause of the already formidable energy loss, a circular electron machine beyond the 
size of the LEP is considered impractical. This is why a linear collider is being con- 
sidered as the future machine for electron-positron collisions. The first working 
linear collider was SLC (Stanford Linear Collider), which operated at ^/s = 90 GeV. 
The technology of the linear collider is not mature and SLC could barely compete 
with LEP in producing physics results. However, for precision physics beyond the 
TeV region, it is considered a must and a team of collaborators from all over the 
world is working to realize the ILC (International Linear Collider). A starting en- 
ergy of ^/~s = 500 GeV is envisaged, later to be increased to 1 ~ 2 TeV. The main 
difficulties of the linear collider are: 

1. Unlike circular accelerators, an accelerating tube can accelerate particles only 
once on a trajectory. Therefore, an increase of the acceleration voltage per unit 
length is necessary from the present level of ~ 1 MeV/m to ~ 100 MeV/m. If 
this is achieved, the total length of the accelerator becomes 1 TeV/ 100 Me V ~ 
10 km. 

2. As two beams can collide only once, the collision probability has to be in- 
creased, too. This requires the colliding beam size to be on order of ~ 0.1 \im 
or less compared to ~ 10 |j,m at present. 

3. As a corollary to the second item, technologies have to be developed to maintain 
and control the accuracy of 0.1 ^m over a distance of 10 km. 
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Particle Interactions with Matter 9 ' 



12.4.1 

Some Basic Concepts 



A particle detector is a device that determines the energy, momentum, spatial posi- 
tion or time response of an incoming particle by inducing interactions with specific 
materials suitable for the purpose. It is useful to understand basic features of inter- 
actions that are used for measurement before discussing individual instruments. 
Particles are classified roughly into two classes depending on their interactions 
with matter: hadrons, which interact strongly, and electrons, photons and muons, 
which interact only electromagnetically. From the detector point of view, weak in- 
teractions (and gravity, too) can be neglected. Neutrinos are usually recognized as 
"missing energy" except for specific devices to detect neutrino interactions directly. 
Suppose a particle with definite energy and direction passes through a material. 
It interacts either electromagnetically (Fig. 12.8a, b) or strongly (c, d). A strong- 
ly interacting particle is usually elastically scattered if the energy is low (< a few 
hundred MeV), but at high energy (E J5> m p c 2 ~ 1 GeV) its reaction is inelastic, 
producing one or more particles. From the incident particle point of view, the en- 
ergy is regarded as lost or absorbed by the material. In the case of electromagnetic 
interactions, the incident particle loses its energy (E — > E — AE) by scattering 
electrons (namely ionizing atoms). It also undergoes multiple scattering, a series 
of small-angle scatterings by the atomic Coulomb potential, and changes its angle 
8 -> 8 + Ad. If a particle is energetic enough, secondary particles produced by 
strong or electromagnetic interactions induce a series of inelastic reactions, mak- 
ing a shower of particles. Among the secondary particles, neutral pions decay im- 



9) [138,261,311 
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mediately to two energetic gamma rays and by a combination of pair production 
and bremsstrahlung produces an electromagnetic shower. Therefore the shower 
phenomenon induced by hadrons is a combination of strong and electromagnetic 
interactions. 



Collision Length and Absorption Coefficient We define a few quantities to give a 
e of reaction probability. 



Cross Section The cross section a is defined as the probability of re 
on average one particle per unit area comes in and hits a target whose density is 
one particle per unit volume. Then when N in particles uniformly distributed over 
an area A hit a target consisting of N tgt particles uniformly distributed in a volume 
V = AL, the number of reactions Nis given by 



N = 



N m 









The cross section has the dimension of an ; 
get's effective area size. The standard unit is 

1 barn = 1 b = 10~ 24 cm 2 



(12.15) 

a and may be considered as the tar- 



1 mb (millibarn) = 10 -3 b 

1 |ib (microbarn) = 10 -6 b 

1 nb (nanobarn) = 10 -9 b 

lpb (picobarn) = 10 _12 b 

1 fb (fentobarn) =10" 15 b 

1 at (attobarn) = 10~ 18 b 



are used. 

A process where particles A and B collide and become particles C (=A) and D 
(=B) is called elastic scattering. Sometimes when the particle species change (C / 
A, D ^ B, for example n~ p — >• K°A), it is called quasielastic. If the final state 
consists of three or more particles C, D, E . . ., it is called inelastic scattering. Let the 
fraction of (C + D) production of all the reactions be BR(CD) (branching ratio) and 
the total cross section ct TO t, then the cross section of the reaction (A + B ->• C + D) 
is given by 



<7(A + B ->• C + D) = a T0T BR(CD) 



(12.18) 



The angular distribution of secondary particles contains important information. 
The fraction of the cross section in which the secondary particle is scattered into 
solid angle dQ at angle (9 , 0) (Fig. 12.9) is called the differential cross section and 
is defined as 



a = / -r^-dQ = [ -^- sin6 d6d(/> 
J di2 J di2 



(12.19) 
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Figure 12.9 When a flux of particles whose scattered into the solid angle dQ at (0,0), 

intensity is one particle per unit area per unit the differential cross section is defined by 

time is incident on a target consisting of a sin- da _ 

gle particle and dN particles per unit time are dQ 



When a particle moves a distance d in matter, the number of collisio 
;nces is given by Eq. (12.15) by putting N in = 1, A = unit area, V = d 



where n is the number density of target particles in the matter. Therefore the aver- 
age length before a particle experiences a collision is given by 

I = (12.21) 

crpN A 

where p is the matter density [g/cm 3 ] and N A = 6.023 x 10 23 g — 1 is the Avogadro 
number. 1 is referred to as the collision length or the absorption length when only 
inelastic collisions are counted. 

The number of particles passing through the matter decreases as they interact. 
If the length is sufficiently short, the rate can be considered to be proportional to 
the length. As the probability that a particle is lost while moving a distance dx is 
given by dx/A, the number of particles lost dN out of N particles is expressed as 

dN dx 

_ = _.-«* (12.22) 

k is called the absorption coefficient. Solving this equation gives 

N = Noexp(-fcx) = N (-*/A) (12.23) 

where N is the initial number of particles. It is often more convenient to use not x, 
the length, but the weight per unit length (£ = px). Denoting the flux (the number 
of particles passing unit area per unit time) as J 

1 = I «ap(-^|) 

1 (12.24) 

lp 

fi is referred to as the mass absorption coefficient. 
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When the energy of strongly interacting particles, such as protons and pions, 
exceeds ~ 10 GeV, the total cross section is ~ 25 mb and the collision length in 
matter with density ~ 1 g/cm 3 is ~ 67 cm. The cross section of electromagnetic 
scattering by electromagnetic interaction (not ionization), for instance, Thomson 
scattering (elastic scattering) by protons is a ~ (&ji/3)a 2 (h/m p c) 2 ~ 0.2 (lb, which 
gives the mean collision length ~ 100 km. The cross section of weak interactions 
is, taking as an example neutrinos from the sun (E ~ 1 MeV), a ~ 10 — ^ cm 2 and 
the collision length is ~ 1 light year in matter with p ~ 1 g/cm 3 . Thus the collision 
length gives a measure to distinguish one type of interaction from another. Exam- 
ples that are often mentioned in the instrumentation, for instance in the design of 
a calorimeter, are the collision and absorption lengths of iron (Fe): 2 ~ 10.5 cm 
and 16.8 cm, respectively. 

A good guess at the strong interaction rate is given as follows. Hadrons such as 
nucleons constantly emit and reabsorb pions and are considered as an extended 
object whose size is of the order of the pion Compton wavelength (X n =hlm Jl c^ 
1.4 x 10 -13 cm). As a nucleus with atomic mass A can be considered as densely 
packed nucleons, it is a good approximation to treat the nucleus as a sphere with 
its radius approximated by r A = A l ^X n . If a hadron hits a nucleus, it reacts imme- 
diately at the surface and the nucleus looks like a totally absorbing black disk. The 
area of the disk is given by 

o a = 2ji( — \ A 2 l l ~ 3.2xl0- 26 A 2/3 cm 2l0) (12.25) 

This is called the geometrical cross section. In practice, adopting the measured 
cross section of Tip scattering, the empirical formula 

a A = 2.5 x 10~ 26 A 2/3 cm 2 (12.26) 

may be used as a measure. 

12.4.2 
Ionization Loss 

Ionization Loss per Unit Weight (dE/di;) The process of losing energy by ioniz- 
ing atoms while passing through matter is the most fundamental phenomenon 
applied for measurement of charged particles [15]. It is important to understand 
the phenomenon in detail. Electrons are bound in atoms and although they are not 
at rest their motion is sufficiently slow that they can be approximated as a static 
target. When a particle with charge ze passes through matter with velocity v, and if 
its mass M is large compared with that (m) of electrons (M » m), the recoil of the 
particle by scattering electrons can be ignored and the particle may be considered 
to follow a straight line track. In the following, we try to calculate the energy loss 

10) Quantum mechanically, the total cross section of a black disk is 2jtR 2 , taking into account shadow 
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using the quantum field theories we learned in Chap. 7 as much as possible. Dur- 
ing the calculation we use natural units but recover fi and c when expressing the 
final results. Let jjf be the current the electrons make and j„ that of the incoming 
charged particle. Then the scattering amplitude is given by [see Eq. (7.3)] 

iT fl = j d 4 xd 4 y(f\T[{-iej>;(x)A fl (x)}{-izej* m (y)A v (y)}]\i) 

= -ze 2 j d 4 xd 4 y(f\j^x)\i)(0\T[A /t (x)A v (y)}\0}(f\j^y)\i} 

(12.27) 

Putting the energy-momentum of the initial and final state p; = (m,0), pj = 
(Ef, p f) = (E, p), we find the matrix element of the electron current becomes 

{/lJe"(*)l*) = u(Pf)y f 'u( Pl ) e -^>-l>f^ (12.28) 

(0|T[A^(x)A v (y)]|0) is the Feynman propagator given by Eq. (6.115): 



(0|T[A„(*)A,,(y)]|<» = -^ / d4 ^-^TJ 



If we neglect the recoil and energy loss of the charged particle, it can be treated 
classically. Assuming the particle follows a straight track along the z-axis, we have 

) V m (Y) = /_" dt u"d 4 (y" - u"t) = y"<5(y V(y 2 )<5(V 3 - vt) 
v" = (l,0,0,v) 
where U? = (y , y v) is the particle's four-velocity and x (= t/y ) is the proper time. 
Integration of x gives (2ji) 4 d 4 (q + pi — pj), and further integration of q makes it 
(2jt) 4 . Integration of y gives the Fourier transform of ] v m : 



f . v v f L l- 

)l = / d 4 yj v m (y)e 1 ^ = — 

J v J-L, 



-2jld( 



<f = (co, q) = p) - pi = (E - in, j>) = (T, p) (12.31) 

where we have explicitly shown the integration limits of z to remind us that the 
length the particle travels is infinite microscopically but finite macroscopically. In- 
serting these equations in Eq. (12.27), we obtain 



5 (p* ~ f) ^v«u(p / )y"u(p 1 ) (12.32) 

lability is given by squaring 
)y the initial state density 2 E; 
state: 

1 2£ ; I 2f-^J J ' I {licflEf 



The scattering probability is given by squaring Tji, multiplying by the final state 
density, dividing by the initial state density 2 E; and taking the average and sum of 
the electron spin state: 



— v u v v L f "'dp z T d 
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Here we have used |2yr<5(p z - T/v)\ 2 = 2jrd(p z - T/v) f dz = 2jtLd(p z - T/v). 
V v is given by Eq. (6.143). Using 

V ' V = \Y\ li °{Pf)y' i Ml?i)][us(Pf)Y v MVi)T 



V*Pf = E f-VPf* 



v /l v v L uv is expressed as 

^I- = 2J2^ + ^j=w( 1+ ^- I ) (12.35) 

Inserting Eq. (12.35) in Eq. (12.33), the ionization probability is expressed as 
4jrz 2 a 2 L m 2 T 1 l/4m 2 y 2 l , , 

dw f = -^--^ b + "VH dpT {12M) 

The energy loss per unit length is obtained by multiplying Eq. (12.36) by the energy 
T given to the electron and its number density n e , dividing by I and integrating 






e T 9 f T (1 l/4m 2 y 2 ) , ,1 

H m 7^TTi7 + Vi<H (12 ' 37) 



The momentum transfer can be expressed in terms of pr and Tusing p z = T/v: 

- q 2 = \ q \ 2 - co 2 = p 2 T + (J-^J -T 2 = p 2 T + (J^j (12.38a) 

On the other hand 

- q i = - { E f - Ei) 2 + {Pf - Pl) 2 = - { E f - m) 2 + P 2 f 
= 2m(E f -m)=2mT 
which is an exact relation with the mass of the charged particle taken into account. 



384 I 12 Accelerator and Detector Technology 

As the energy loss T given to the electron is mostly in the nonrelativistic region, we 
may approximate 

E f = m + T ~ m (12.39a) 

Q 2 = -q 2 ~ p 2 (12.39b) 

Then Eq. (12.37) can be approximated by 



^WAJl^jl 1 | 1 



The maximum value of Q, 2 ^ = 2mT max occurs when the charged particle hits 
head on and scatters the electron in the forward direction. The value of T is given 
by 

T= 2mfi Tlt2 2 (12-41) 

(m/M + y) 2 -(3 2 y 2 cos 2 8 v ' 

where is the electron scattering angle and M the mass of the charged particle. 
From this T max ~ 2m y 8 2 y 2 . Q 2 ^ is (T min /fiy) 2 from Eq. (12.38a). T must be at 
least larger than the ionization energy I of the atom to be ionized, which means 
Q^ ain = (I/f3y ) 2 . Thus the expression for dE/dx is 

which is referred to as the Bethe-Bloch formula [137]. 

For practical applications, it is more convenient to give the ionization loss per 
unit weight (£ = px). Inserting h and c to recover the right dimensions and taking 
into account the density effect ((5/2, which will be described soon), the ionization 
loss per gram-centimeter according to the Particle Data Group [311] is given by 

d£ = pdx p : density of matter 

K = 4jiN A r 2 m c c 2 = 0.3071 MeVcm 2 /g 

Na = 6.022 x 10 23 g^ 1 = Avogadro number 

r e = e 2 /(4jr£o>"eC 2 ) — 2.82 x 10~ 13 cm = classical electron radius 

Z, A = atomic number and atomic mass of matter 

I ~16Z 09 eV = average ionization potential of matter 

d = density effect 
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From the formula, some characteristics of the ionization loss can be read off. The 
ionization loss per unit weight d E/d § is 

1. proportional to Z/A and oblivious of matter species (weak dependence on In J) 

( 

2. a function of the velocity of the incoming particle. 

3. At low j3y < 1, the curve has 1//3 2 dependence, while at large fiy J>> 1 has a 
logarithmic rise (~ In y 2 ). 

Figure 12.10 shows the energy loss of the positively charged muon in copper (Cu). 
The Bethe-Bloch formula is relevant in the region fiy = 0.1 to 1000. At higher en- 
ergies, radiative losses due to bremsstrahlung and pair creation become important. 
For hadrons (jt, p, etc.), strong interactions become dominant at high energies 
before the radiative processes come in. As can be seen in Fig. 12.10, the ionization 
rises slowly (~ lny) and for a wide range of /?/ the ionization loss is more or 
less constant. This is referred to in the literature as minimum ionization loss. It is 
o remember the fact that 



Minimum Ionization Loss: The energy loss per gram of relativistic particles is 
about 1.8 MeV and barely depends on the energy. 



The ionization loss is a statistical phenomenon. The measured data fluctuates 
around the value given by Eq. (12.43). The distribution is Gaussian, but with thin 
materials the statistical occurrence of ionization just once or twice with a big kick 
to the electron is sizeable, producing an asymmetric distribution with a long tail 
on the high-energy side, which is called the Landau distribution (Fig. 12.11a). 




Figure 12.10 dE/dx from positi 
from [311]. 



= p/Mc. Figure taken 
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Figure 12.11 (a) The Landau distribution, 
which is a typical energy loss in a thin ab- 
sorber. Note that it is asymmetric with a 
long high-energy tail. Figure from [311]. (b) 



dE/dl; curves as a function of m 

p = Mfiy.AsdE/dg is a function of ySy 

only, the curves separate depending on the 



In practical applications, information on momentum (p = Mj3y) rather than 
velocity is easily obtained. Then the ionization curve is different for different mass- 
es (Fig. 12.11b). Thus combined knowledge of p and dE/di; can be used to identify 
particle species. 



Density Effect The logarithmic rise In y 2 in the ionization loss is due to the rela- 
tivistic effect of contraction of the distance from the passing particle to the atoms 
being ionized (or transverse electric field E± oc y). However, if many atoms ex- 
ist in this region, the screening effect due to dielectric polarization attenuates the 
effect, which is referred to as the density effect. 

Let us consider how the ionization formula is affected. In matter, the photon 
satisfies the dispersion relation 



. I^If 



(12.44) 



where co is the wave frequency, q the wave number, e (normalized as e = 1) the 
dielectric constant and n the refractive index. Denoting the phase velocity of light 
in matter by c m (= c/^/e), the transverse wave number by q T and using q z = cu/v, 



(12.45) 

For e > 1, a real photon (Cherenkov light) is emitted if v > c m (described later in 
Sect. 12.4.4). 

Figure 12.12 shows argon's dielectric constant as a function of photon energy. 
The region of interest is the absorption region (10 eV < tico < lOkeV), where the 
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Figure 12.12 Dielectric constant (e = 
ti + ie 2 ) of argon gas at 1 atmosphere, (a) 
t'2 is displayed as photon absorption length in 
meters (oc 1/e 2 ). (b) £'i - 1. Abscissa is pho- 
ton energy in eV. In the optical region, matter 



is transparent with £2 = 0, e i > 1. This is 
the region in which Cherenkov light can be 
emitted. The absorption region has an effect 
on the ionization loss. The transition radiation 
occurs in the X-ray region. Figure from [14]. 



dielectric constant is expressed as 



hw p = h 




2 T P[g/cr 



where n e is the electron number density and co p is the plasma frequency. In the 
absorption region, the light is partially absorbed and the dielectric constant is com- 
plex. In this region, q T is imaginary and the transverse electric field decreases ex- 
ponentially. Writing q^ = iic , 



c -jf-^r e © -?v^ 



where c m is the velocity of light in matter. The above formula tells us that when 
saturation due to the density is effective, Q m ; n is not given by 7/(c/3y)butby/jft» p /c. 
Formally, we only need to replace e by e/n and c by c m = c/n in the formula to 
give the ionization loss in matter. As a result of this the second term in the last 
equality of Eq. (12.38a) is replaced by k 2 . In the language of quantized field theory, 
the photon has acquired mass due to the polarization effect of matter. From the 
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relation (w/c) 2 = q 2 /n 2 ~ q 2 (l — 8n 2 ) = q 2 + (w p /c) 2 , the acquired mass is given 
by tico p /c 2 . 

Let us estimate at what value of /?y the density effect becomes sizeable. The 
maximum transverse distance y msx at which the charged particle can have influ- 
ence without considering the density effect is given roughly by 

ft hcBy /20eV\ „ „ 

y max - ^— = -f- - \—r) Py x 10 cm ( 12 - 48 ) 

The size 10 — 6 cm is of the order of the average intermolecular distance of gas at 
one atmosphere. On the other hand, the maximum distance of the density effect at 
maximum saturation is given by 

Vmmax = — - x 10" 6 cm (12.49) 

Comparing the two, the value of /?y to reach saturation is given by j3y ~ l/y/>, 
which is ~ 100 for gas. But for a liquid or solid saturation is reached at small values 
of 13 y . When the density effect is sizeable, the ionization loss is smaller by 6/2 



Comparing this equation with the original formula, dE/d^ increases only as 
~ ln/Jy in contrast to the original ~ ln(/?y ) 2 (see the curve in Fig. 12.10 with and 
without the density effect). 

Range When a charged particle enters a thick layer of matter, it loses all its energy 
by ionization loss and stops in the matter. The distance it travels is called the flight 
distance or range. Let E be the original energy of the particle and R the range, then 



-f** = f 



As E/Mc 2 = y = 1/^1 -/3 2 and dE/dx are both functions of/? only, R/M i; 
also a function of j3 only. Referring to Eq. (12.43), the expression for R can be givei 



R = — Gtf) (12.52) 

Namely, the range is proportional to the mass of the incoming particle, and inverse- 
ly proportional to the charge squared. The concept of the range is useful only when 
energy loss other than ionization loss can be neglected. This is the case for muons 
with energy less than a few hundred GeV and low-energy hadrons (R < I, where 
2 is the nuclear interaction length or roughly those with E — M < a few hundred 
MeV). Energetic hadrons (E — M > 1 GeV) usually interact strongly with matter 
before losing all their energy and induce shower phenomena by cascade interac- 
tions. The calorimeter utilizes the shower phenomenon to measure the hadronic 
energy. 



12.4.3 

Multiple Scattering 

In the consideration of the ionization loss, recoils the traversing particle receives 
from electrons were totally neglected. However, scattering of a particle by nuclei is 
nonnegligible because they are heavy. Because the probability of Rutherford scat- 
tering is inversely proportional to sin 4 8/2 (see Sect. 6.3.1), most of the scattering 
occurs at small angles. Therefore, when a particle traverses a thick material, the 
effect of Rutherford scattering is a superposition of statistically independent small- 
angle scatterings. Namely 

(o 2 ) = J2 i < 12 - 53 ) 

which shows that the average scattering angle increases as the particle proceeds. 
Since it is a statistical phenomenon, the distribution is Gaussian. Namely, the an- 
gular distribution is given by 



Let us calculate the average scattering angle using the Mott formula given i 
Sect. 6.7. Since the angle is small, the spin effect can be neglected. Using \q\ 2 = 
2p 2 (l-cos6) ~ p 2 2 , 



v 2 q 4 
n be found from 



[e 2 d o= 4 -^[ d 4 

J v 2 p 2 J q 2 



&JiZ 2 c 



divided by / do. Here, r max — l/g min is the distance at which the Coulomb field 
of the nuclei is screened by electron clouds. We take r max as the Thomas-Fermi 
atomic radius (r atm = 1.4a B Z~ 1/3 = 7.40 x 10~ 9 Z -1 / 3 ), where a B = ti/am c c is 
the Bohr radius. For r min , we adopt the nuclear radius r A = A lli h/m Jl c. When a 
particle traverses a distance X cm, it is scattered N (N = X f do pN A /A) times on 
average. Then recovering h and c, the average scattering angle 6 is given by 

, ,, ,,, l(>jrZ 2 (ahc) 2 pN A /A, /r max \ 1/2 E 2 X 

(0 2 ) = N(6 2 ) = X '■, In -^ =— ^- — 

v 2 p 2 \r mm ) v 2 p 2 X 

(12.57a) 
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, ?N A Z 2 Pl 183 

X _1 = 4a r] In — (12.57c) 

E s is referred to as the scale energy and X as the radiation length. It is customary 
to express the radiation length in g/cm 2 , in which case X -> X<J = pX is to be 
understood, and we follow the custom in the following. According to more detailed 
calculation or fits to the data [138, 311], the average scattering angle is given by 



>o 2 > = ; 
2 N A Z{Z H 



1) , 287 1 
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(12.58b) 



" Z(Z + 1) ln(287/VZ) 
The accuracy of measuring the spatial position of a particle track is considerably 
affected by multiple scattering, especially at low energies. In fact, in deciding the 
particle momentum using the curvature of a charged track in the magnet, the limit 
of accuracy very often comes from the indefiniteness due to multiple scattering. 



12.4.4 

Cherenkov and Transition Radiation 



A charged particle flying at constant speed does not radiate in vacuum, but can 
emit photons in matter. Physically, this can be interpreted as dipole radiation of po- 
larized matter that the moving charge has created. If the particle velocity is smaller 
than the velocity of light in the matter v < c/n, the polarization is symmetric and 
no light is emitted because of destructive interference (Fig. 12.13a). 
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the polarization is asymmetric and light is 
emitted (Cherenkov light). If the medium is 
not uniform, the polarization becomes asym- 
metric and light is emitted even if v < c/n 
(transition radiation). 

If v > c/n, the polarization is asymmetric and constructive interference results in 
light emission (Cherenkov light). If the medium is not uniform, the polarization 
distribution is asymmetric and light is emitted even if v < c/ n. 
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The transition amplitude for radiation by a particle is given by 

iT fi = (f\ -ij H mt d*x\i) = -iej d 4 x(f\A /t (x)j^(x)\i) (12.59) 

jm(x) is a classical current created by the incident charged particle and is given by 
Eq. (12.30). Treating photons as a quantized field 

(f\A tl (x)\i) = E ;(Xy kx (12.60a) 

Integration over x gives 

j£(*) -* y J- 

i/2 (12.60b) 

U = J <5(*)<3(y)(5(z - vt)e ik -*d*x = J^ ^ dze *(-/v-*«-9)* 

where L is the length of material that the particle goes through. The transition 
amplitude becomes 

dw fi = Y |T f> | 2 ^^= T A^-f \U^-^— (12.61) 

'' f-J ' } ' (2;r) 3 2tu rl I V | J (2jt:) 3 2w l ' 

The polarization sum is given by 

The above equations apply to vacuum. The electromagnetic field (photon) in matter 
can be obtained by replacing 

dV~e), e = n 2 (12.63) 

Inserting Eqs. (12.60), (12.62) and (12.63) in Eq. (12.61), the number of emitted 
photons per unit frequency and solid angle dQ is expressed as follows (recovering 
physical dimensions with h and c): 

d 2 N awn I f L/2 - ■ ,, | 2 

— — - = — — / dz(k x v)e' z ' Zf \ (12.64a) 



Z F = I nk cos I 

.s referred to as the formatio 
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Cherenkov Radiation If the medium is uniform, the integration gives a d-function 
and nonzero value only when the phase vanishes, namely 



cos 9 C ■- 



v/c n 



(12.65) 



Therefore, only when the particle velocity exceeds that of light in the matter does 
the traveling particle emit photons, which are called Cherenkov light. Carrying out 
the integration, using |2jt<5(---)| 2 = 2jid(---) f dz = 2ji6(--- )L and dividing by I, 
we find the number of emitted photons per unit length and per unit energy interval 
(dE = dhw)is 



d 2 N 
dEdx 



— = — | 1 r— ) = — sin 2 6 C « 370 sin 2 6 C eV" 1 cm" 1 (12.66) 

dx he \ n p ) nc 

Note, the index of refraction is a function of the photon energy (£ = hco). 



Transition Radiation If the velocity of the charged particle does not exceed the ve- 
locity of light in the material, the phase of Eq. (12.64) does not vanish and no radi- 
ation is emitted in a uniform medium. However, if there exists nonuniformity in 
the medium, for instance, if there is a thin dielectric foil in vacuum with thickness 
I at position — L/2 5 z 5 1/2 (see Fig. 12.14), they give a finite contribution to the 
integral and the photon is emitted. This is called transition radiation. As is clear 
from the above arguments, transition radiation and Cherenkov radiation have a 
common origin. 

For this configuration, the number of photons per unit energy interval and per 
unit solid angle is obtained from Eq. (12.64) by integration. The integral is of the 
form 

r L/2 / ,— 1/2 ^oox pL/2 noo 

/ f(ni)+[ + )f(n 2 )= {f(ni)-f(n 2 )}+ f(n 2 ) 

J — L/2 \J—oo J L/2 / J — L/2 J—oo 




Figure 12.14 Emis: 
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where n 2 = 1 is the index of refraction in vacuum. The second term gives a d-func- 
tion and vanishes in vacuum. Therefore, 

d 2 N aw . 2 n \ smULIl) (co jv-n^k cos 0)} , I 2 



dEdQ 4jt 2 c 2 I m/v-riik cos 6 K '] 

a . 2Q [ sm[(a>LI2c)(llP- ni cmO)] I 2 

= 4^ nsm e [ i/p-mcoto (Ml -* M2, J 

(12.68) 

In the region \z\ 3> Z F , the exponential function oscillates violently and does not 
contribute, but the integral over the formation zone (|z| < Zp) can give a finite 
contribution. This means the foil thickness I must be larger than the size of the 
formation zone, which in practical situations described later is tens of microme- 
ters. There are no constraints on the index of refraction n and transition radiation 
occurs for a wide range of a>. For co in the optical region (see Fig. 12.12), it gives 
rise to optical transition radiation at wide angles at modest velocities, that is for /3 
below the Cherenkov threshold. However, we are interested in applications for very 
relativistic particles with, say, y > 1000. In the X-ray region, the index of refraction 
is smaller than and very close to 1. It can be written as n = ^e ~ 1 — co 2 /2eu 2 [see 
Eq. (12.46)]. The intensity of emitted photons is high only where the denominator 
in Eq. (12.68) nearly vanishes, which occurs at very small angles of 8 ~ 1/y . Then 
using P ~ 1 — l/2y 2 , the denominator can be approximated by 



1 



= 1 - w 2 /2cu 2 and n 2 
a 2 2 Ul/l < 



where we have put n\ = 1 — oj 2 /2a> 2 and n 2 = 1. The 



The two numerators taken approximately equal were factored out and we have used 

nsin 2 <9 2 ~ (nsin0) 2 = sin</> 2 ~ cp 2 (12.71) 

where is the emission angle in vacuum (see Fig. 12.14). Equation (12.70) repre- 
sents the interference between the two equal and opposite amplitudes with relative 
phase given by the argument of the sin 2 factor. The interference can be important, 
but if there is no interference, the flux is 

d 2 N ^ a 
dEdQ ~ ~u 2 w~' v I 1/y 2 + w 2 lw 2 + 2 1/y 2 + ; 
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Integration over solid angle dQ = Jidcp 2 gives 

dE dE jt\) \ycoJ J V co > | I 

(12.73) 
= a J2m(ya> p /a>) w/ y(Wp « 1 

The spectrum is logarithmically divergent at low energies and decreases rapidly 
for co > yco p . Integration of Eq. (12.73), however, is finite and gives a total ener- 
gy flux ayhw p /3 and a typical photon energy of order yhco p /4 [216, 311], where 
hcOp ss 20 eV. Therefore the radiated photons are in the soft X-ray range 2-20 keV. 
Since the emitted energy is proportional to y , the transition radiation is inherently 
suitable for measurements of very energetic particles. However, only ~ a photons 
are emitted with one thin foil. The yield can be increased by using a stack of foils 
with gaps between them. 

12.4.5 

Interactions of Electrons and Photons with Matter 

Bremsstrahlung The interactions of electrons and photons differ considerably 
from those of other particles and need special treatments. The reason is as fol- 
lows. An energetic charged particle loses its energy by bremsstrahlung under the 
influence of the strong electric field made by the nuclei. According to classical 
electrodynamics, the loss is proportional to (2/3)a|r| 2 , where r is the acceleration 
the particle receives. Since \r\ oc Z/m, the loss rate is proportional to (Z/m) 2 , 
which is large for light particles, especially electrons. 

The bremsstrahlung cross section is given by the Bethe-Heitler formula 
Eq. (7.71). The cross section for emitting a photon per photon energy interval 
is obtained by integrating over the electron final state and photon solid angle. 
The calculation is tedious. Besides, target nuclei are not isolated but surround- 
ed by electrons, which effectively screen the nuclear electric field. Therefore, we 
only quote a result in the relativistic limit with the screening effect taken into 
account [61, 203]: 

2 2 E f dco[(Ef+E 2 f 2 \ 183 ll 



where £;, £f = £; — co, co are the energies of the initial and final electron and pho- 
ton, respectively, r e = ahl(m e c) being the classical electron radius. If we neglect 
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Table 12.1 Radiation length and critical energy 



H 2 (liquid) 69.3 382 



the second term in the square brackets of Eq. (12.74), which is on the order of a 
few percent, the equation can be rewritten as 

da A 1 (A A ,\ 

y = j (12.75b) 

where the radiation length X is given by Eq. (12.57c). 

A characteristic of bremsstrahlung is that the number of emitted photons is pro- 
portional to 1/co and hence the energy spectrum is nearly flat. The energy loss is 
obtained by 



dE r" J '»» da , 



- = n A / 



(12.76) 



where n A is the number density of atoms. The result is an easy to remember for- 

dE E rly 

—7- = — , -> E = E e-^ x « (12.77) 

ax x 

Actually X is defined as the length where the energy loss by bremsstrahlung is 
equal to that by ionization. Figure 12.15 shows Eq. (12.77) is a good approximation 
for energies higher than a few hundred MeV. As 1/X a Z 2 , (dE/d^) hlems a Z 2 
and(dE/dt) lomzat i 0n oc Z, 

(dE/d£) brems „ g[MeV] E_ 

(d£/|) lomzation 800/(Z + 1.2) E c [ ' ' 

£ c = 800 [MeV]/(Z + 1.2) is called the critical energy. 12 ' At E > E c the energy 
loss due to bremsstrahlung is larger than that due to ionization and at E < E c 
the converse is true. The critical energy is an important parameter in designing a 
calorimeter. Table 12.1 lists radiation lengths and critical energies of a few exam- 
pies. 

12) There are other definitions of E c and refined approximations. See [311]. 
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Figure 12.15 Radial 



;s of the electron. Figure taken from [311 



Photon Absorption Photons lose energy through three processes: the photoelectric 
effect, Compton scattering and pair creation. For E y 3> a few MeV, pair creation is 
the dominant process. 

Pair creation is closely related to bremsstrahlung. It can be considered as 
bremsstrahlung by an electron with negative energy and that the electron itself 
is transferred to a positive energy state. The cross section is almost identical to 
that of bremsstrahlung and hence shares many common features. Like electrons, 
the photon intensity (energy x number per unit area per unit time) as it passes 
through matter is approximated by 



dl - 



-fildx 
I e-" x = he 



fx is called the mass absorption coefficient if x is expressed in g/cm . The (mass) at- 
tenuation length I = 1/ fi of the photon for various materials is given in Fig. 12.16. 
At high energies (E } , > 100 MeV), they become constant and Eq. (12.79) is a good 
approximation in the high-energy region. The attenuation length is conceptually 
analogous to the radiation length. Above E y > 1 GeV 



holds to within a few percent. 



(12.80) 



Cascade Shower 

Longitudinal Profile A very high energy electron loses its energy by bremsstrahlung 

but the radiated photon produces a pair of electrons, which produce photons again. 
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Figure 12.16 Photon a1 



sngth. Figure taken from [31 1]. 



Thus a great number of electrons and photons are created in a cascade by an en- 
ergetic electron or photon. The multiplication continues until the electron energy 
reaches the critical energy, after which the ionization effect takes over. If the materi- 
al length is measured in units of radiation length and the ionization and Compton 
effect are neglected (Rossi's approximation A [336]), the shower profile is indepen- 
dent of material. If only the Compton effect is neglected (approximation B) and if 
the energy is measured in units of critical energy, the result is also independent of 
material. 

A Simple Model Basic characteristics of the cascade shower can be understood by 
considering a very simple model [203]. Let us assume that the electron as well as 
the y-ray lose half their energy as they pass a radiation length X . The electron 
emits a y -ray and the y -ray creates a positron-electron pair. Then at the depth of 
2X , the incident electron that had E is converted to two electrons, one positron 
and one photon, each having energy £ /4. Proceeding this way, at the depth t in 
units of X , the numbers of electrons and photons are given by 



(12.81a) 
(12.81b) 



and the energy of each electron is given by E = £ /2 f . Assuming the shower stops 
at E = E c , then E c = £ /2 tmax . Therefore 



(12.82a) 
(12.82b) 
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r t(E) 2 r t{S) 

> E) = / dtN = - dtexp(tln2) 



2 e^' 1 " 2 
" 3 In 2 



dN 2 £ 1 

••• «=" IETF < 12 ' 82d > 

As there are two electrons for each photon, the length per track is (2/3) X , which 
gives the total length traveled by the charged particles 



'-!r*»-(i)7'* 



" 91n2 



As the ionization loss is approximately constant regardless of the particle's ener- 
gy, the total energy of the incident particle transferred to the cascade shower is 
given by (dE/dx) x L <x E . The depth at shower maximum is t max ~ ln(£o/-Ec)- 
This means the length of the shower increases logarithmically with the energy. The 
shower stops abruptly at the shower maximum owing to the simplicity of the mod- 
el whereas the real shower decreases exponentially. However, basic characteristics 
of the model are confirmed qualitatively by both Monte Carlo calculations and data. 
Analytical formulae are useful for understanding the characteristics of the show- 
er, but with the advent of high speed computers Monte Carlo programs have come 
in handy for simulating showers. Figure 12.17 shows a longitudinal profile of the 
shower produced by a 30 GeV electron. The longitudinal shape is well reproduced 
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Figure 12.17 A profile of a 30 GeV electron 
shower generated by the EGS4 simulation 
program. The histogram shows fractional en- 
ergy deposition per radiation length and the 



curve is a gamma-function fit to the distribu- 
tion. Circles and squares indicate the number 
of electrons and photons with total energy 
greater than 1.5 MeV. Figure taken from [311] 



by a gamma function of the form 
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(12.84a) 



f(a) 

x/X , t max = ^- = ln(£/£ c + Q) (12.84b) 

0.5, "> C e = -0.5, C,=0.5 (12.84c) 



where t max is the thickness at shower maximum and a is calculated from 
Eq. (12.84b). 

Transverse Profile So far we have ignored transverse expansion of the shower. 
In actual situations, although both bremsstrahlung and pair creation are well char- 
acterized by the radiation length, Coulomb scattering becomes significant below 
the critical energy. A characteristic length representing the transverse expansion is 
given by the Moliere unit 

RM=|x ~^[g/cm 2 ] (12.85) 

where E s is the scale energy defined in Eq. (12.57b). It is known that 90% of the 
total energy is contained within a radius r = R M and 99% within 3.5 R M . A detector 
to measure the total energy of an incident particle by inducing showers is called a 
calorimeter. 

LPM Effect In bremsstrahlung, both the scattering angle of electrons and the 
emission angle of photons are small. This means the kick given to the electron 
is almost transverse and the longitudinal momentum transfer is very small: 

II = pt-pf-o) = ^ . -m e -y f -m e -a> ~ j^- ~ -^ (12.86) 

Then, due to the Heisenberg uncertainty principle, the region of photon emission 
is spatially extended ~ 1/ fey . This is reintroduction of the formation zone (Z F ) that 
we described in the section on transition radiation. The length of the formation 
zone is given by ~ 2y 2 /co, which is on the order of 10 iim for 100 MeV photons 
emitted from 25 GeV electrons. If the condition of the electron changes before it 
goes out of the zone, interference with the emitted photon occurs. For instance, 
if the electron receives a sizeable multiscattering due to many scattering centers 
inside the zone and if the net scattering angle becomes comparable to that of the 
emitted photon, the emission is attenuated. This is referred to as the LPM (Landau- 
Pomeranchuk-Migdal) effect [234, 250, 275]. When the emitted photon energy is 
small (i.e., has long wavelength), the density effect (see the paragraph "Density 
Effect" in Sect. 12.4.2) has to be taken into account, too. Here, due to dielectric 
effect of matter, the wave number becomes k —*■ nk, which induces a phaseshift 
(n — l)feZ F and hence destructive interference. 

13) Exact numbers depend on material. See for instance, Figure 27.19 of [311]. 
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In order to quantify the above argument, we note that the amplitude of photon 
i is proportional to the formation zone Z F . Using the approximations 1 — 
i l/2y 2 , 1 - n ~ Wp/w 2 and 1 - cos 6 ~ d 2 /2, Eq. (12.64b) can be written as 



at l + y 2 6 2 + y 2 co 2 v lto 2 

Equation (12.87) shows that the attenuation of the photon e 
the second and third terms in the denominator become nonnegligible compared to 
the first term. The LPM attenuation results from the second term and the density 
attenuation from the third term. Defining o>lpm by the condition that the multiple 
scattering angle becomes equal to the emission angle 

y 2 K 2 )lx=z F(y ^=i) = 1 -> ^lpm = -^ (12.88a) 

(12.88b) 

From the above formula we see that the photon emission is suppressed when 

fiat E e 
hat < hat LPM or — < (12.89) 

As Elpm is given as 4.3 TeV in lead and 151 TeV in carbon [234], this is primarily an 
extremely high energy phenomenon. The quantitative estimation of the suppres- 
sion is not easy, because the number of terms in the bracket of Eq. (12.68) increas- 
es. Qualitatively speaking, the a> dependence on the number of emitted photons 
shifts from l/&> to 1/^fat at ut <SC E e . Figure 12.18a shows a comparison of the 
energy loss per radiation length of uranium by a 25 GeV electron with the standard 
Bethe-Heitler formula [26]. 

We may say that the ultrahigh energy electron behaves like a pion, because emis- 
sion of photons and electrons is suppressed and the effective radiation length be- 
comes larger. 

The density attenuation comes in when the phase shift of the emitted photon 
becomes of the order of 1: 

<-i»* ="•"#.■ = 1 (12 , 01 

i.e. fico ~ yhat p ~ Y {p [g/cm 3 ]} 1 / 2 x 21 eV 

The suppression effect is due to the third term in the denominator of Eq. (12.87). 
For the case of electrons with 25 GeV energy (y = 5 x 10 4 ), the density effect comes 
in at at ~ 5 MeV in lead and ~ 1 MeV in carbon, which are both smaller than the 
LPM effect. The LPM effect exists also in pair creation by photons. 

Although the LPM effect was pointed out long ago theoretically, it has been con- 
sidered only in treating ultrahigh energy cosmic ray reactions [224, 278] and a quan- 
titative treatment was difficult. One experiment was carried out using an electron 
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Figure 12.18 LPM and dielectric effect in 
bremsstrahlung [25, 26]. (a) Calculated ener- 
gy spectrum of photons emitted by 25 GeV 



icident o 



e LPM 



curve (dashed line) is compared with the 
Bethe-Heitler (BH) formula (solid line). 
Experimental data are the emitted photon 
spectra from 25 GeV electrons incident on 



5% X uranium (b) and 6% X carbon (c). The 
solid line is a Monte Carlo calculation with 
LPM + DE (density effect) + TR (transition 
radiation). The dotted line (LPM + TR1) and 
the dashed line (BH + TR2) use different TR 
formulae. The deviation of BH from a flat line 



beam at SLAC [26]. Figure 12.18 shows the result. Both the LPM and density effects 
are observed. The rise of the spectrum at the low energy end is due to transition 
radiation. 

The LPM effect is sizeable only at extremely high energies. However, it may be- 
come a problem when designing a calorimeter for the future LHC or ILC accelera- 
tors in the tens of TeV region. 

12.4.6 

Hadronic Shower 14 ' 



Energetic hadrons incident on thick material also make a shower. If we scale by 
nuclear absorption length instead of radiation length, arguments from the electro- 
magnetic shower are generally valid, too. Namely, the total length of all the tracks 
is proportional to the incident energy and the size of the shower grows as In E, 
etc. There are some characteristics peculiar to hadronic showers. We show typical 
shower configurations of the hadronic and electromagnetic showers side by side 
(Fig. 12.19) to demonstrate their similarity as well as their difference. 
i) Absorption Length Usually the absorption length X is much longer than the ra- 
diation length: l Fe = 131.9, A Pb = 194, XoFe = 13.84, X 0Pb = 6.37 (all in units 
of g/cm 2 ). The hadronic shower grows slowly compared with the electromagnetic 
shower. This fact can be used to separate them. 

ii) e/h Compensation In nuclear reactions, a sizeable fraction of the energy disap- 
pears. Some fraction is used to break nuclei (binding energy) and some is carried 



14) [135, 152, 3 



. 346]. 
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Figure 12.19 Monte Carlo si 
netic cascades in the earth's 
(vertical) axis range: 0-30.1 I 



, X/) 



of the different development of hadronic and electromag- 
lere, induced by lOOGeV protons (a) and photons (b). [Z 
axis range: ±5 km around shower core.] Image by [345]. 



away by neutrinos, which are the decay product of pions. Besides, slow neutrons, 
which are produced in plenty at nuclear breakup, are hard to observe. Consequent- 
ly, the energy resolution of hadronic calorimeters is worse than that of electromag- 
netic calorimeters. Furthermore, the secondary particles are mostly pions, of which 
~ 1/3 are neutral and promptly decay to two photons. They produce electromag- 
netic showers, which are localized relative to hadronic showers. If all the energy is 
measured this is not a problem, but a practical calorimeter is usually of sampling 
type and has a structure consisting of layers of absorbers with active detectors sand- 
wiched between them. The electromagnetic part of the shower may not be sampled 
or, if it is, the measured energy is much higher because of its local nature. Both ef- 
fects enhance the statistical fluctuation, especially that of the first reaction. The 
hadronic calorimeter has an intrinsic difference in the detection efficiency for the 
electronic and hadronic components and its ratio is dubbed the e/h ratio. The per- 
formance can be improved by enhancing the detection efficiency of the neutron 
and deliberately decreasing that of the electric component. As uranium captures 
slow neutrons with subsequent photon emission, the efficiency can be improved if 
it is used as the absorber material [134, 389]. However, as it is a nuclear fuel and 
highly radioactive, a conventional choice is to use a material with high hydrogen 
content (for example, organic scintillators) to enhance the detection efficiency for 
slow neutrons and to optimize the ratio of the absorber thickness to that of the de- 
tector material to adjust the e/h ratio at the expense of reduced energy resolution. 
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Balancing the responses of electron and hadron components improves linearity of 
the energy response and is effective in measuring the so-called missing energy. 



12.5 

Particle Detectors 



12.5.1 

Overview of Radioisotope Detectors 



Particle detectors derive information from light or electrical signals that are pro- 
duced by particles when they interact with material as they enter the detector. His- 
torically the first detector was a fluorescent board, with which Rontgen discovered 
the X-ray. Rutherford used a ZnS scintillator to measure the a -ray scattering by a 
gold foil, which led to his celebrated atomic model. The measurement was a tedious 
observation by the human naked eye using a microscope. Later, ionization cham- 
bers, proportional counters and Geiger counters were introduced. Their signal is 
electrical. Then, around 1940, there was a major step forward with the invention of 
photomultipliers, which enabled amplification of fluorescent light combined with 
fast electronics. Another breakthrough was made by G. Charpak when he invented 
the multiwire proportional chamber (MWPC) in 1968, which laid the foundations 
of modern detector technology. 

A Wilson cloud chamber was used to discover the positron and strange particles, 
emulsions to discover jt mesons, and bubble chambers to discover a great number 
of resonances. They were of a type that covered the complete 4jt solid angle. They 
were powerful in that they could reconstruct the entire topology of reactions, in- 
cluding many secondary particles, and played an important role historically. How- 
ever, because of their long response and recovery times, they become obsolete as 
the need for rapid measurement and fast processing time increased. 

Modern measuring devices use a combination of various detectors, including 
light-sensitive counters, gas chambers and radiation-hard solid-state detectors with 
computer processing, and achieve the functions of (1) complete-solid-angle cover- 
age, (2) the ability to reconstruct the whole topology and (3) high-speed processing. 

In the following, we first explain detectors classified according to their output sig- 
nals, then describe their usage depending on what one wants to measure. Signals 
are of two kinds, photonic and electronic. Detectors with light outputs are scintil- 
lation counters, Cherenkov counters and transition-radiation counters. Detectors 
with electric signals are semiconductor detectors and various gaseous chambers. 
Roughly speaking, light signals are faster than electric signals by at least an order 
of magnitude, but the latter excel ir 
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12.5.2 

Detectors that Use Light 



a) Scintillation Counters 

i) Organic Scintillators Fluorescent materials such as anthracene (Ci 4 H 10 ), p-ter- 
phenyl (C 18 H 14 ), tetraphenyl butadiene (C 2 8H 2 2) and their mixtures are used as 
light-emitting scintillators. They come as liquid scintillators, when dissolved in an 
organic solvent such as toluene (CyH-s), or solid plastic scintillators, when mixed 
in plastic bases. 

Charged particles ionize atoms in the scintillator as they pass through. Ionized 
electrons excite molecules to excited energy levels and when they return to the 
ground state they emit UV (ultraviolet) photons, which are generally outside the 
sensitive regions (400-700 nm; 1 nm = 10 -9 m) of photomultipliers. Therefore, 
usually wavelength shifters (another type of fluorescent material) are mixed to gen- 
erate photons of longer wavelength (Fig. 12.20). Commonly used organic scintilla- 
tors produce light pulses with attenuation time ~ 10 -9 s and are suitable for fast 



ii) Inorganic Scintillators (Nal, Csl, BiGe, BGO (bismuth germanate), etc.) Histor- 
ically Nal has been in long use but others share a common light-emitting mech- 
anism. A small amount of thallium is mixed in Nal (Nal-Tl), which become pho- 
toactive centers, capture ionized electrons and emit light on falling from the excit- 
ed state to the ground state. One advantage of inorganic scintillators is a large light 
output (> 10 times) compared to organic scintillators, which gives much better res- 
olution (a few percent at 1 MeV). Another advantage is their short radiation length 
(2.6 cm with Nal, 1.12 cm with BGO), which is used to make a compact electromag- 
netic calorimeter. Disadvantages are that they are slow (attenuation time ~ 10 -6 s) 
and generally expensive. 

Ionization excitation of base plastic 

10 m Forster energj 

\S primary II; ■ 

? (~l%wt/wt) 
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10 4 m ? y 



S emit blue, -400 nm 

&j absorb blue photon , . , . . 
* photodetector 

Figure 12.20 Scintillation ladder depicting ity (~ 10~ 8 m) without radiation. Approximate 

the operating mechanism of plastic scintil- fluorescer concentrations and energy transfer 

lators. Forster energy transfer is an energy distances for the separate subprocesses are 

transfer between two chromophores (part of shown [311]. 
molecules responsible for its color) in proxim- 
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iii) Structure of Plastic Scintillators Light from scintillators is guided to photon 
sensors (typically photomultipliers but photodiodes and other semiconductor de- 
vices are also used) through transparent plastic light guides. Figure 12.21 shows 
a few examples of the basic structure of plastic scintillators. Figure 12.21a is the 
most primitive; (b) is an improved (called adiabatic) type, which has better light 
collection efficiency. This was achieved by matching the areas of the light-emitting 
surface and receiving surface by using bent rectangular strips. Liouville's theorem 
of phase-space conservation works here. With the advent of collider detectors, a 
compactified multilayer structure without dead space looking into the interaction 
point has become important. Figure 12.21c uses optical fiber to achieve this at the 
expense of light collection efficiency and guides light to far away photomultipliers 
or other light-collecting devices. 

A typical structure of a photomultiplier is shown in Fig. 12.22. When photons 
hit the photocathode, an electron is emitted with probability 10-20% or higher and 
hits the first dynode D 1; emitting a few more electrons. They are attracted and 
accelerated by the higher electric potential of successive dynodes. The potential 
is supplied by the accompanying circuit shown at the top of the figure. This is 
repeated as many times as the number of dynodes and anode. For an incident 
electron, 3-4 electrons are emitted, thus multiplying the number of electrons in 
cascade. As they reach the anode, the number of electrons is multiplied to (3 — 
4) 14 ~ 10 8 , producing a strong electric pulse. 

Plastic scintillation counters excel in time response and resolution and are used 
to detect the arrival time of particles and trigger other devices. They can also mea- 




Figure 12.21 Scintillation counters with prim- used for stacks of scintillators, typically in 

itive (fishtail) (a) and improved (adiabatic) (b) calorimeter where compactness takes pre 

light guides, (c) The use of optical fibers em- dence over light collection efficiency. PM = 

bedded in a groove as light guides is mostly photomultiplier. 




Photo-cathode 

Figure 12.22 Schematic layout of a photomultiplier and voltage supply taken from [298]. 



406 I 12 Accelerator and Detector Technology 

sure the time interval that the particle takes to fly a certain length (TOF; time of 
flight) and determine the particle's velocity. Combined with the momentum infor- 
mation this can be used to determine the mass, i.e. particle species. A typical plastic 
scintillator has a density of 1 g/cm 3 . Therefore, when an energetic charged particle 
passes a scintillator of thickness 1cm, it deposits ~ 1.8 MeV of energy in it. As- 
suming an average ionization energy of 30 eV, it creates ~ 6 x 10 4 ion pairs and 
produces ~ 10 000 photons. As the average energy of fluorescence is (hv) ~ 3 eV, 
about 1-2% of the energy is converted to photons. Assuming geometrical efficiency 
of 10% for the photons to reach the photomultiplier and assuming quantum effi- 
ciency ~ 20% for conversion to electrons, the number of emitted photoelectrons 
is about 200, which has statistical uncertainty of 10-20%, a typical resolution of 
plastic scintillators. 

Problem 12.2 

Assume that light signals with an attenuation time of 10 ns (1 ns = 10 -9 s) are 
10 ns long triangular pulses and are amplified by the photomultiplier given in 
Fig. 12.22. Assuming a gain (amplification factor) of 10 7 , R = 50Q (RC « 10ns, 
where C is stray capacitance of the circuit), what is the voltage of the pulse height 
obtained from 200 photoelectrons? 



b) Cherenkov Counter 

i) Basic concept The Cherenkov counter resembles the scintillation counter in that 
it extracts light from a transparent material. It differs in that it does not emit light if 
the particle velocity is below a threshold (/J = v/c = 1/n; n = index of refraction), 
and also in that the emission is not isotropic but directional. The response time 
is also much faster, restricted only by the photomultiplier response. The emission 
angle 6 with respect to the particle direction is determined by 

cos<9 = — (12.91) 

The energy loss per unit length is given by Eq. (12.66). Denoting the frequency of 
the Cherenkov light as v 

Here, ze is the charge of the incident particle. In terms of the wavelength it is 
proportional to AXjl? and the spectrum is dominated by photons in the violet to 
ultraviolet region. The number of photons per unit length is obtained by dividing 
the above formula by hv. Putting n approximately c 
(vi~v 2 ) where photomultipliers are sensitive, we obtain 

dN 2jzz 2 ci . , „„„ , . , „ 
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As an example, putting z = 1, /J = 1, n = 1.33 (water), dN/d% ~ 200cm -1 . 
Further setting /I = 600 nm, dE/dx ~ 400eV/cm. As the ionization loss is 
~ 1.8MeV/g, only 0.02% of the energy is used and the light emission efficiency 
is ~ 1/100 that of a scintillator. 

ii) Threshold-Type Cherenkov Counter Counters of this type are used to identify 
particle species by utilizing the fact that light is emitted only when /J > 1/n. For 
instance, when there is a mixed beam of electrons, muons and pions with fixed 
momentum, the velocities are /J e > fi fl > p m because m e < m„ < m n . As the 
index of refraction of the gas changes in proportion to the pressure, two Cherenkov 
counters can be arranged so that one of them detects electrons only and the other 
electrons and muons but not pions. Then the two counters in tandem can separate 
three species of the particle. 

iii) Differential-Type Cherenkov Counter Even if /J > 1/n, the emission angle de- 
pends on the velocity. By picking up light emitted only in the angle interval from 
8 to 8 + A 8 , particle identification is possible. Figure 12.23a shows a differential- 
type Cherenkov counter called DISC [265], which was used to identify particles in 
the hyperon beam (a 15 GeV secondary beam produced by a 24GeV primary pro- 
ton beam at CERN). The beam enters from the left and goes through a gas radiator. 
Emitted light is reflected back by a mirror, through a correction lens and slit, re- 
flected through 90° by another mirror and collected by photomultipliers. Only light 
between 8 and 8 + A8 is detected by use of a slit. To achieve A/3/,8 = 5 x 10~ 5 , 
a correction lens made of silica gel is used. Figure 12.23b shows a velocity curve of 
the Cherenkov counter. The chosen velocity is varied by changing the pressure of 
the nitrogen gas. 

iv) RICH (Ring Image Cherenkov Counter): In this type of counter the emitted light 
is mapped on a plane to produce characteristic ring patterns. An example used as 
part of the collider detector complex at DELPHI [119] is shown in Fig. 12.24. 



, Photomultiplier 
Correction Lense 




(a) (I 

Figure 12.23 (a) A differential-type gas 
Cherenkov counter used to identify particles 
in the hyperon beam [265]. (b) A velocity curve 
of the Cherenkov counter to identify 15 GeV/c 




particles produced by the 24 GeV primary pro- 
ton beam. The velocity is varied by changing 
the pressure of the nitrogen gas. 
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Figure 12.25 Cherenkov light detected by the RICH counter. (a,b) Cherenkov light fro 
liquid radiator projected on a plane perpendicular to the incident particles, (a) Photon 
by a single particle, (b) Accumulated light from 246 cosmic ray muons. 



Functionally, it is two counters in one. The front section uses the liquid radiator 
QFi4 and the rear section uses the gas radiator C 5 Fi 2 . Cherenkov light from the 
gas radiator is reflected back by mirrors and detected by the same drift tube de- 
tector that detects the light from the liquid radiator. The drift tube (similar to the 
drift chamber to be described later) contains liquid CH 4 + C 2 H 6 + TMAE, where 
TMAE is tetrakis(dimethylamino) ethylene. The signals are processed by comput- 
ers. Figure 12.25 shows a pattern projected on a plane perpendicular to the incident 
particles. 

Figure 12.25a is the response for a single particle and (b) is a collection of 
light signals obtained from 246 cosmic ray muons. The poor photon statistics of 
Cherenkov light is reflected in the small number of detected photons. 



c) Transition Radiation Counter The transition radiation 
at the boundary of two materials with different dielectric 



light emitted 
As described 
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before, the origin is dipole emission of polarized media induced by the passage of 
a particle. As the emitted photon energy is proportional to y = 1/^1 — /3 2 , it is 
most suitable for detection of extremely relativistic (y > 0(1000)) particles. The 
differential emission spectrum is given by Eq. (12.72). The emission angle of the 
photon is concentrated in the forward region (6 < 1/y). The spectrum decreases 
rapidly above id > y co p and the lower cutoff is determined by the detector condi- 
tion. Taking the cutoff frequency w c = 0.15y a> p , the detected number of photons 
per layer is given roughly by 



N } ,(oj > 0.15y« p )~0.5a 



(12.94) 



As the number of photons emitted is extremely small, some several hundred layers 
with gaps between are necessary for practical use. In a practical design, the transi- 
tion radiation counter consists of radiators and detectors. A dominant fraction of 
soft X-rays are absorbed by the radiator itself. To avoid absorption of emitted X-rays 
by the absorbers, they have to be thin. But if they are too thin the intensity is l 
duced by the interference effect of the formation zone (Z F ~ a few tens of microi 
eters). Besides, for multiple boundary crossings interference leads to saturation 
y sat — 0.6w v ^/£il 2 /c, where t\, lj are the thickness of the radiators and the sps 
ing, respectively [30, 155]. As the X-ray detectors are also sensitive to the parent par- 
ticle's ionization, sufficient light to surpass it is necessary. A practical design has to 
balance the above factors. Radiators that have been used come in two kinds: a stack 
of thin foils or a distribution of small-diameter straw tubes. Figure 12.26 shows 
the latter type used in VENUS/TRISTAN [340]. The radiator consists of polypropy- 
lene straw tubes with 180 |j,m diameter placed in He gas. With cutoff at 11 keV, it 
achieved 80% efficiency for electrons with rejection factor greater than 10 against 



Radi 




ition counter, (a) Schematic layout of one used in 
:s radiation spectra [340, 341]. Note the pion's sig 



410 I 12 Accelerator and Detector Technology 



12.5.3 

Detectors that Use Electric Signals 



a) Caseous Detectors 

Basic Structure Electrons and ions can move at high speed in gas and when the 
proper potential is applied they can be extracted as electric signals. Three types of 
gaseous detectors - ionization, proportional, and Geiger tubes - were used first 
historically. They share the same structure but have different operating voltages 
(Fig. 12.27). A rare gas such as argon (Ar) or xenon (Xe) is confined in a cylindrical 
tube with a metal wire stretched at the central position. The reason for using rare 
gas is so that ionized electrons do not easily recombine with positive ions. The 
outer wall of the cylinder (cathode) is grounded and high voltage is applied to the 
central wire (anode). The strength of the electric field is proportional to 1/r. When 
a charged particle passes through the tube, electrons and positive ions are made 
along the track. The number of ion pairs is proportional to the energy loss of the 
particle. If no voltage is applied, they recombine and restore electric neutrality, but 
if a voltage is applied, electrons are attracted to the anode and positive ions are 
attracted to the cathode, creating an electric current. However, as positive ions are 
much heavier they are slow in reaching the cathode and the signals they create are 
usually discarded. 

Operation Mode When the applied voltage is low, exactly the same number of free 
electrons as are created by ionization are collected by the anode. Detectors oper- 
ating in this region (ionization mode) are called ionization tubes (or chambers, 
depending on the shape of the detector). The electric signal is small and used on- 
ly when a large enough number of electrons are created. With higher voltage, the 
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Figure 12.27 (a) Basic structure of a gaseous counter, (b) The number of collected ic 
ous detection modes depending on the applied voltage. 
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accelerated electrons knock out additional electrons from the gas molecules. By 
making the radius of the anode wire small, an electric field with large gradient 
is created near the anode, the secondary electrons can ionize the molecules and 
multiplication begins, leading eventually to a cascade. When the voltage is not too 
high, the number of electrons collected at the anode is proportional to the number 
produced by ionization (proportional mode). Detectors in this range (proportional 
tubes) can attain an amplification factor of 10 3 -10 s by varying the voltage. 

With further increase of the applied voltage, the space charge effect due to the 
ion pairs distorts the electric field and the proportionality is lost (limited streamer 
mode). However, some designs take advantage of the high output, which elimi- 
nates the need for electronic amplifiers. The induced cascade is still local, but with 
even higher voltage it spreads all over the detector region (Geiger mode), giving 
a saturated output signal regardless of the input. The reason for the spreading of 
the cascades is that excited molecules emit light when going back to the original 
states and excite more molecules in the far distance. In this mode, a quenching 
gas to absorb light is needed to stop the discharge. The recovery time is long and 
it is not suitable for high-speed counting. Note that, depending on the inserted gas 
species, the region of limited proportionality may not appear. The relation between 
the voltage and the number of output electrons is depicted in Fig. 12.27b. 
Ionization Chamber As the output is small in the ionization mode, it is used only 
for special purposes, such as a calorimeter, where all the energy is deposited in 
the detector. The use of liquid increases the density and enables the detector to 
contain showers in a small volume and also increase the number of ionized elec- 
trons per unit length. In liquids, molecules such as oxygen capture electrons and a 
small fraction of such impurities prevents electrons from reaching the anode. For 
this reason only rare gas liquids such as Ar and Xe are used. Even in this case, 
the concentration of oxygen impurities has to be less than a few ppm (parts per 
million = 10 — 6 ). There are many advantages in using it in high-energy experi- 
ments, but the disadvantage is that the liquid state can be maintained only at liquid 
nitrogen temperature (77 K) and therefore necessitates a cooling vessel (cryostat), 
which makes a dead region. To avoid this, so-called warm liquids (organic liquids 
referred to as TMP and TME) are being developed. 

MWPC (Multiwire Proportional Chamber) The MWPC, invented by Charpak [95] 
in 1968, made a big step forward in the high-speed capability of track detection. Ca- 
pable of high speed and high accuracy, it has accelerated the progress of high-ener- 
gy physics and has found widespread application as an X-ray imager in astronomy, 
crystallography, medicine, etc. For this contribution, Charpak was awarded the No- 
bel prize in physics in 1992. The basic structure is depicted in Fig. 12.28a. Many 
wires are placed at equal intervals between two cathode plates and high voltage is 
applied between the wires (anodes) and the cathodes to be operated in proportional 
mode. The electric field is shaped as in Fig. 12.28b and each wire acts as an in- 
dependent proportional tube. When a particle traverses the plate perpendicularly, 
ionized electrons are collected by nearby wires, thus achieving position accuracy 
determined by the spacing of the anode wires. 
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field [312]. 



A typical example has wire spacing 0.5-2 mm, plate separation 1.5-2 cm, electric 
voltage 3-4 kV and radius of the anode wires 10-50 urn. The field is 10 4 -10 5 V/cm 
and the amplification factor is ~ 10 4 . A space resolution of 0.5 mm is obtained. For 
the gas in the chamber, Ar with 10% methane or so-called magic gas [Ar 75% + 
isobutane (C 4 Hi ) + Freon 13 (CF 3 Br)] are commonly used. 

Usually two MWPCs are used in combination to determine both x and y coor- 
dinates. Sometimes more planes with u or/and v coordinates (slanted relative to 
x, y) are used to eliminate multi-hit ambiguity. If strips of metal sheet instead of 
planes are used as cathodes, a single plane of the MWPC can obtain x and y si- 
multaneously. The risetime of the MWPC is fast and the decay time is dictated by 
the ion mobility and the size of the chamber. If the wire spacing is a few millime- 
ters, the decay time can be very fast (25-30 ns) and the MWPC can sometimes be 
used to obtain time information. Each wire can cope with a very high luminosity of 
~ 10 4 s _1 mm _1 . 

i) Drift Chamber (DC) The basic structure of a drift chamber is given in Fig. 12.29. 
Contrary to the proportional chamber, the field lines in the DC are horizontal be- 
tween the anode and cathode wires. Besides, a number of field wires with gradu- 
ated voltages are used to obtain a uniform electric field. Ions produced by incident 
particles drift towards the anode with constant velocity. The drift time together with 
the time information about the particle passage obtained separately determines the 
distance from the anode wire. The anode wires are in the proportional mode and 
the field distribution near the anode is similar to that in the MWPC. The uniform 
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Figure 12.29 Basic structure of a drift chamber [77]. The field lines in the figure are horizontal 
between the anode and cathode wires. A number of field wires with graduated voltages are used 
to obtain a uniform electric field. 
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electric field has ~ lkV/cm extending 1-10 cm. As the drift velocity of the elec- 
trons is 4—5 cm/ (is and the accuracy of time measurement is ~ 2 ns, it can give 
an accuracy of position determination ~ 100 urn. Drift chambers are often used 
in a magnetic field to determine the momentum of particles. In a magnetic field, 
electrons feel the Lorentz force and the voltage applied to the field wires is varied 
to compensate it. As the operational mode is the same as the MWPC, the response 
time and radiation hardness are similar. There are many variations of drift cham- 
bers, some of which will be introduced later. 

ii) GEM (Gas Electron Multiplier) With the advent of semiconductor microchip 
technology, much smaller strips of conductive plates have come into realization. 
GEM is one such technology among many similar detectors (micromega detector, 
etc., see [311]). Figure 12.30 shows the layout of the GEM detector. It has essentially 
the same structure as the MWPC, however, the anode intervals are much smaller. 
Multiplication of ions is carried out not by a cylindrical electric field but by a dif- 
ferent potential across the hole and outcoming electrons are captured by nearby 
anodes. 




Figure 12.30 (a) Layout of a GEM detector. 
Ions in the drift space are attracted by the 
GEM, go through an amplification hole and 
are collected by nearby anodes, (b) Schematic 
view and typical dimensions of the hole struc- 
ture in the GEM amplification cell. Electric 
field lines (solid) and equipotentials (dashed) 



are shown. On application of a potential dif- 
ference between the two metal layers, elec- 
trons released by ionization in the gas volume 
above the GEM are guided into the holes, 
where charge multiplication occurs in the high 
field [311]. 



b) SSDs (Solid State Detectors) When a small amount of an element with five 
valence electrons (P, As, Sb) is added to a semiconductor with four valence elec- 
trons such as silicon (Si) or germanium (Ge), excess electrons are easily separated 
to become free electrons (n-type semiconductors or n-SSD) and the SSD becomes 
slightly conductive. On the other hand if an element with three valence electrons 
(B, Al, Ga, In) is added, one electron is missing for each covalent bond and an 
electron is borrowed from somewhere else, resulting in excess positive freely mov- 
ing holes (p-type semiconductor or p-SSD). The elements with five valence elec- 
trons are called donors and those with three, acceptors. Free electrons and holes 
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are called carriers. When p-SSD and n-SSD are in plane contact with each other 
(p-n junction), thermal motion causes electrons to diffuse into p-SSD and holes 
into n-SSD. The diffused electrons (holes) annihilate holes (electrons) on the other 
side, leaving both sides of the junction with ionized donors and accepters. As these 
ions are bound in lattices, there remain distributed static charges, namely electric 
fields. The field works in the opposite direction to prevent conduction and even- 
tually balances with the diffusing force. Carriers are absent in this region, which 
is called the depletion layer. The voltage across the depletion region is called the 
diffusion voltage. The outside region is neutral and the potential is nearly flat. 

When one applies reverse bias, namely, positive (negative) voltage on the n (p) 
side of the junction (Fig. 12.31a), electrons and holes are attracted and extend the 
depletion region. No current flows. If, on the other hand, forward bias is applied 
(Fig. 12.31b), the diffusion potential is reduced, causing electrons on the n side and 
holes on the p side to diffuse beyond the depletion region, and a current flows. 

If a charged particle passes through the depletion region, ionized electrons and 
ions along the track are attracted by the diffusion voltage and a pulse current flows. 
Ion pairs outside the depletion region diffuse slowly, giving a slow component that 
is mostly ignored. The p-n junction can be considered as a condenser with elec- 
trodes placed at the two edge of the depletion layer. Let C be its capacitance, C A 
the effective capacitance of an amplifier connected to the junction to read out the 
electric signal, and Qthe amount of all the charges collected. Then the magnitude 
of the pulse is given by QJ(C + Ca). A characteristic of a semiconductor is that the 
energy levels of the impurity are very close to the conduction band, and it takes on 
average only 3.0 eV (Ge) or 3.7 eV (Si) to make pairs of ions. We have already stated 
that in ordinary matter it takes about 30 eV to ionize atoms. This means that the 
number of created ions in the SSD is ten times and statistical accuracy is greatly en- 
hanced. An ionization loss of 1.8 MeV can produce 1.8x 10 s /3.7 ~ 5 x 10 5 ions with 
statistical error ofl/^/~N ~ 0.2%. If we consider the additional geometrical loss of 
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Figure 12.32 Schematic layout of 
pled. 



scintillation counters, the resolution is at least xlO that of Nal and approximately 
xlOO that of plastic scintillation counters. 

If a SSD is made in the form of arrays of strips with a width of say 30 um us- 
ing methods taken from the manufacture of integrated circuits (Fig. 12.32), it can 
function as a high precision position detector. Applications of SSDs as high-quality 
detectors are plentiful. The disadvantage is that it is difficult to manufacture large 
detectors and they are prone to radiation damage. 



12.5.4 

Functional Usage of Detectors 

The information one wants to obtain using particle detectors is classified as follows. 

(1) Time of particle passage: Detectors are used to separate events that occur within 
a short time interval or to determine the particle's velocity by measuring it at two 
different locations. The most widely used detectors are scintillation counters with 
time resolution A t ~ 100 to 200 ps (1 ps = 10~ 12 s). 

(2) Position of particles: Detectors are used to reconstruct tracks, the direction of 
momentum and patterns of reactions. MWPCs (multiwire proportional chambers 
with position resolution Ax ~ 1mm), DCs (drift chambers with Ax ~ 100 to 
200 (im) and SSDs (solid state detectors with Ax ~ 10 to 100 |im) are used. For 
some special experiments where detection of fast decaying particles is at stake 
emulsions are used to obtain Ax ~ 0.1 to 0.5 jim. When the speed matters, ho- 
doscopes with reduced position resolution in which strips of scintillation counters 
are arranged to make a plane come in handy. They are used for triggering (select- 
ing desired events from frequently occurring reactions and initiating data taking) 
or for crude selection of events. 

(3) Energy-momentum (E, p) is obtained by dE/dx counters (which measure the 
ionization loss by a thin material), range counters or calorimeters (which n 
the total energy deposited in a thick material). To obtain the momentum o 
sures the curvature of a particle's circular motion in a magnetic field. 
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(4) Particle identification (PID) is carried out by measuring the mass of the parti- 
cle. With combined knowledge of the momentum p and velocity v, the mass value 
can be extracted from m = p/vy. A pattern of particle reactions contains much 
valuable information. A long track without interaction may be identified asa/i, 
an electromagnetic shower as y or e (depending on the existence of a pre-track), 
a V-shaped decay pattern with long (typically a few cm ~ 1 m) missing tracks as 
K°, A and with (typically 10 |j,m ~ 1 mm) short missing tracks as r leptons or B, C 
particles. In a many-body system, knowing the energies and momenta of particles 
can give an invariant mass ((mc 2 ) 2 = (£ E;) 2 — (^ Pi) 2 ) of the whole system. 
In the following we list detectors classified by their function. 

Momentum Measurement A particle with momentum p makes a circular motion 
in a magnetic field, and from its curvature the momentum can be determined [see 
Eq. (12.11)]: 

p T [GeV/c] = 0.3 Bi? [Tm] (12.95) 

Note, however, that the measured momentum is a component (jj t ) transverse to 
the magnetic field. The total momentum can be calculated if the angle with re- 
spect to the magnetic field is known. Let us assume that a track-measuring device 
is placed in the magnetic field and n points of a track with total projected length I 
within the magnet are measured. The momentum can be determined by measur- 
ing a quantity called the sagitta, which is defined as the distance from the middle 
point of the arc to a line made by two end points. For a not so large angle where 
6 ~ sin 6 holds 

From the formula, one sees that the momentum resolution improves as the mag- 
netic field and path length become larger and as the resolution of space measure- 
ment improves. The momentum resolution is given by 
Ap T AR As 8R 8 p T 

One needs to measure at least three points to determine the sagitta and the res- 
olution improves as the number of measured points increases. If we denote the 
resolution of a point measurement as Ax, the relation and the momentum resolu- 
tion are given by 



In addition to this measurement error, one has to consider the error due to multiple 
scattering (see Sect. 12.4.3). 
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Figure 12.33 shows a cylindrical drift chamber to determine momenta of sec- 
ondary particles built in the VENUS/TRISTAN detector [27]. The magnetic field is 
along the beam direction and the wires are also strung in the same direction. A 
unit cell is made of one anode wire and eight field wires. The equipotentials of the 
electric field are circular and the position accuracy does not depend on the angle of 
incidence of the particle. 



Energy Measurement If a particle loses energy only by ionization, the range 
(Sect. 12.4.2) can determine the total kinetic energy of the particle. However, this 
method can be applied only to muons or low-energy hadrons below a few hundred 
MeV. The majority of particles interact in one way or another before they stop. A 
calorimeter can measure the total energy of practically all known particles except 
muons and neutrinos. A calorimeter can measure the energy of neutral particles, 
too. They have become an indispensable element of modern high-energy collider 
experiments. They are suitable for observation of jets, which have their origins in 
energetic quarks and gluons, and by making a calorimeter a modular structure, 
it is possible to cover a large solid angle. The depth of a calorimeter grows only 
logarithmically with energy. 

i) Total Absorption Cherenkov Counter This is a Cherenkov counter of total ab- 
sorption type. A transparent material that contains a heavy metal having large 
density and short radiation length is suitable for the purpose. Lead glass (p ~ 5, 
X ~ 2 cm), for instance, with a depth of 15-20X can contain almost all electro- 
magnetic showers and can detect Cherenkov light, whose output is proportional to 
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the total length of the shower tracks. Thus it serves as an electromagnetic calorime- 
ter and can measure the total energy of an incident electron or a gamma ray. Other 
materials that can be used include Nal, Csl, and BGO. 

ii) Sampling Calorimeter For hadrons a total absorption counter made totally of 
active material is impractical because the size becomes too big. For economic rea- 
sons, a sandwich of iron or lead absorbers and active detectors is frequently used. 
As detectors plastic scintillators or liquid argon proportional chambers are used. 
The calorimeter is usually placed in the magnetic field, but it is better to place 
the photomultipliers outside the field. Figure 12.34a shows the basic structure of a 
calorimeter. Showers are sampled when they pass through the active detector. Ide- 
ally a separate counter should be used for each sampled signal, but in order to avoid 
having a large space occupied by light guides and photomultipliers, which may cre- 
ate a dead angle or space, light outputs are often transmitted to a wavelength-shifter 
and led to a photomultiplier through light guides. Figure 12.34b shows a configu- 
ration in which UV light is converted to blue light by the wavelength-shifter in the 
scintillator, which in turn hits a second wavelength-shifter after passing through 
the air gap and is converted to green light. Light with longer wavelength can trav- 
el further and thus can be guided along a long path to photomultipliers that are 
placed behind the absorber/radiator or where there is little effect of the magnetic 
field. Recently, optical fibers such as in Fig. 12.21 have also been used. As show- 
ers can produce a large number of electrons, ionization chambers can be used as 
detectors that can extract more accurate information. Here absorbers play a dou- 
ble role, being electrodes, too (Figure 12.34c). A practical design of a liquid argon 
calorimeter (that used in the ATLAS/LHC detector) is shown in Fig. 12.35. 

As the difference between the electromagnetic and hadronic calorimeters lies in 
the use of radiation length versus interaction length, different materials and sam- 
pling rates should be used. Usually lead is used for electromagnetic calorimeters 
and iron for hadronic calorimeters. However, the structure can be the same. If 20X 
of lead is used for the electromagnetic calorimeter, this amounts to 0.65 interaction 




Figure 12.34 Structure of sampling calorime- 
ters, (a) A calorimeter compactified to be used 
in a collider detector. To avoid a dead angle 
and to make the solid angle as large as possi- 
ble, light is transported through a wavelength- 
shifter and light guide to distant photomulti- 



pliers. (b) Conceptual and enlarged layout of 
the light transmission mechanism of (a), (c) 
Liquid argon is used instead of scintillators as 
photon detectors. Absorbers are immersed in 
liquid argon in a cryogenic vessel. 
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A module of the liquid ar- 
gon calorimeter used in the ATLAS/LHC 
experiment, (a) Accordion structure of 
the sampling layers with a shower simu- 
lation superimposed, (b) Actual design 
of a module with solid angle coverage of 
At/ x A0 (radian) = 0.025 X 0.0245. For 



;sofa 



Eq. (12.100). The t 



electrode made of thin copper and Kapton 
(polyimide) sheets is placed in the middle of 
the lead layers. The total thickness is 22 radia- 
tion lengths. It has attained an energy resolu- 
tion of a/E = 10%/ Vf + 0.3% [32, 209]. 



length. For this reason, electromagnetic calorimeters are usually placed in front of 
hadronic calorimeters. 

iii) Energy Resolution of a Calorimeter The calorimeter uses a shower event, which 
is a statistical phenomenon. The detected total length is proportional to the energy. 
If most of the shower can be contained in the calorimeter, the energy resolution a is 
expected to be ~ \\\[~E considering the statistical (Poisson) nature of the shower. In 
the sampling calorimeter, the statistics become worse proportional to the absorber 
thickness d. The combined effects on the resolution are expected to be 



JE V £ 

As the above argument shows, the energy resoli 
at low energy (< lOGeV). On the other hand, 
magnetic field is worse in proportion to the inver 
the low-energy region momentum measurement 






i of the calorimeter 
i resolution u 
5 of the momentum. Namely, in 
sing a magnetic field is accurate, 



and at high energies the calorimeter is a better method. Modern collider detectors 
usually use both to cover all the energy regions. 



Particle Identification (PID) The identification of particle species can be achieved 
by knowing its mass. It can be determined if two variables out of v, E = my, p = 
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m 2 (GeV/c 2 ) 
Figure 12.36 Particle identification using TOF method (At = 200 ps, L = 175 cm). 



m/3y are known. We have already shown that at fixed momentum the Cherenkov 
counter can select velocities and can identify particles. 

i) Time-of-Flight (TOF) Method Particle identification is done by measuring the 
velocity of particles by the most direct method, namely by measuring the time of 
flight between two counters. This method usually uses two scintillation counters, 
but any counters capable of fast timing can be used. Figure 12.36 shows data taken 
by TOF counters in VENUS/TRISTAN, which has At = 200 ps and a flight dis- 
tance of 175 cm. The figure shows that particles of p < 1 GeV/c can be identified 
with the above setup. 

ii) E/p Method The momentum of a particle is measured with a track-measur- 
ing device in the magnetic field and the energy is measured by electromagnetic 
calorimeters. Electrons give E/p ~ 1, but hadrons (pions) give E/p <C 1, because 
they hardly deposit any energy in the electromagnetic calorimeter as X (nuclear in- 
teraction length) J>> X (radiation length). Usually the method is used to separate 
electrons from hadronic showers, but taking the shower profile further into con- 
sideration, a rejection factor e/jr of 10 3 -10 4 has been obtained without degrading 
the detection efficiency (e > 90%) [134, 389]. 

iii) Method of dE/dx As dE/dx is sensitive only to /?, knowing the particle mo- 
mentum enables PID to be achieved. For this purpose dE/dx has to be measured 
accurately. The most advantageous detector for this method is the time projection 
chamber (TPC). It can measure p, dE/dx and space position of all particles si- 
multaneously and thus can reconstruct the topology of an entire reaction that has 
occurred in the chamber. It is a variation of the drift chamber and its structure is 
depicted schematically in Fig. 12.37. The chamber is a large-diameter cylindrical 
tube with a central partition that works as a cathode and two endcaps that work as 
anodes. It is placed in a solenoidal magnet and is used as a collider detector. The 
beam goes through the central axis of the TPC and the interaction point is at the 
center of the chamber. 

A TPC that was used for TOPAZ/TRISTAN experiments has an inner radius of 
2.6 m, length of 3 m and the applied voltage is —43.25 kV [223]. All the secondary 
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Figure 12.37 Illustration of the TPC used in 
TOPAZ/TRISTAN [223]. Inner radius 2.6 m, 
length 3 m. Ions from particle tracks are at- 
tracted away from the center of the TPC in 
the z direction. At the end wall anode wires 
of eight trapezoidal MWPCs read the r coor- 
dinate of the ions. At the back of the anodes 



cathode pads obtain the <p coordinate by read- 
ing the induced voltage. The arrival time gives 
thez coordinate and the number of ions gives 
dE/dx information. By reconstructing tracks 
and measuring their curvature, the momen- 
tum is determined and with dE/dx the parti- 
cle species can be identified. 




Figure 12.38 dE/d£ curves as a function of rr 
ly, curves made by different particle species at f 
from [311]. 



i. As dE/dZ is a function of fiy on- 
lentum p = mfSy separate. Figure 



particles go through the chamber except at small solid angles along the beam, 
therefore it essentially covers An solid angle. A magnetic field is applied parallel 
to the cylinder axis, which is used to measure particle momenta. It is also designed 
to prevent transverse diffusion of ions so that they keep their r-rp position until 
they reach the anodes. Ions generated along tracks of the secondary particles drift 
parallel to the beam axis at constant velocity toward the endcap. The endcap con- 
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sists of eight trapezoidal MWPCs, which can read the r position of the ions. Pads 
at the back side of the MWPCs read the induced voltage capacitively and obtain 
the coordinate cp perpendicular to r. From the drift time of ions the z coordinate is 
known. The signal strength at the anode is proportional to the ionization loss of the 
particles, and is sampled as many times as the number of wires, giving good statis- 
tical accuracy for the measurement of dE/dx. From the curvature of the tracks in 
the magnetic field, the momentum is determined and together with dE/dx infor- 
mation this allows particle species to be identified. Figure 12.38 is a dE/d^ curve 
obtained by the TPC of the PEP4 experiment [311]. 

As described above, the TPC is capable of reconstructing three-dimensional coor- 
dinates of all the tracks, momenta and dE/dx. The information obtained is compa- 
rable to that of bubble chambers and yet TPC can function at high speed. It is a very 
powerful detector suitable for electron-positron collider experiments. However, it 
is not without its limitations in that it takes 30-50 (is for ions to reach the endcaps, 
and has difficulties in applications that require high-repetition data taking, such as 
in hadron collider experiments. 



12.6 

Collider Detectors 



a) Basic Concept The mainstream modern detector is a general-purpose collider 
detector, which is a combination of detectors with various functions. Many of them 
share a common basic structure as given in Fig. 12.39, where the basic elements 
and the particle's behavior in them are illustrated. A typical example, the ALEPH 
detector used in LEP/CERN [10], is shown in Fig. 12.40. It is used to measure Z° 
properties accurately and, together with other detectors at LEP, has been instru- 
mental in establishing the validity of the Standard Model. 




Muon Chamber 
Return Yoke 
Hadron Calorimeter 

Electromagnetic Calorii 
Particle Identifyin] 
Track Chamber 

Vertex Detector 



■ 



12.6 Collider Detectors I 423 

We list a few characteristics common to all the general-purpose collider detectors, 
(i) General purpose: They are made to respond to almost all (> 90%) kinds of 
conceivable reactions. Particle species that can be identified include e (electrons 
and positrons), y and jt°, fi, charged hadrons (mostly pions) and neutral hadrons 
(neutron, K°, A, etc.). 
(ii) Necessary conditions for a general-purpose detector are: 



1. capability of identifying a particle's species, measuring the energy-momentum 
no matter in which direction a particle flies, 

2. uniformity of the sensitivity and accuracy over the complete solid angle, 

3. hermeticiry: ideally the detector should have no dead angle, but practically the 
beam pipe makes a dead zone in the forward and backward regions. But at 
least the momentum and energy components of all the particles transverse to 
the beam direction should be able to be measured. This is important in order 
to measure the missing energy, which gives important information about the 
production of neutrinos or some of the suggested new particles. 



As can be seen from Fig. 12.40, the detector is separated into two parts, a bar- 
rel region (45° < 6 < 135°) and an endcap region (a few degrees < 6 < 45°). 
They have radial symmetry in the cross-sectional view perpendicular to the beam 
direction (Fig. 12.41) and also forward-backward symmetry with respect to the in- 
teraction point. 16 ' The barrel and endcap parts have different geometrical structure 

colliders like those at B-factories and HERA (e-p collider at 




Figure 12.40 Side view of the ALEPH d. 
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Figure 12.41 Cross-sectional view of the detectc 

ALEPH detector at LEP [10]. The ALEPH de- electroi 

tector consists of (from the center outwards) ing coil, 8. hadron calorimeter, and 9. muon 

1. beam pipe, 2. silicon semiconductor vertex chamber. 

and necessitate different approaches in designing them, but functionally it is desir- 
able to have uniformity to avoid experimental errors. 
We will describe a few of the important functions of the components. 

b) Measurement of Interaction and Decay Points of Short-Lived Particles Reactions 
produced by beam particles colliding with residual gas in the beam are unwanted 
events. To remove them it is necessary to determine the interaction point by trac- 
ing back several tracks. Track detectors inserted in the magnetic field can reproduce 
them in principle, but if one wants to measure decay points of short-lived particles 
(charm and beauty particles), which have lifetimes 10 -12 s < r < 10 -13 s and fly a 
distance of only I = cry , i.e. tens or hundreds of micrometers, a specialized vertex 
(crossing points of tracks) detector is needed. SSD detectors are used, and resolu- 
tion of ~ 10 u,m in the rep direction, ~ 20 |j,m in the z direction were realized with 
the ALEPH detector. Figure 12.42 shows an example of decay-point measurements 
and Fig. 12.43 a lifetime curve of a short-lived particle. 



c) Momentum Analysis Momenta of charged particles can be determined with a 
combination of magnet and track chamber. A superconducting solenoidal coil is 
used to generate a strong magnetic field of 1-1 . 5 T (Tesla) . The direction of the mag- 
netic field is usually chosen along the beam direction and thus only the transverse 
component of the momentum is measured but with knowledge of the polar angle 6 
of the particle the total momentum can be reconstructed. The tracking chamber us- 
es drift chambers of one type or another. By stretching wires along the beam direc- 
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Figure 12.42 Display of a two-jet event reco 
ed by the ALEPH detector, in which a B os 
candidate has been reconstructed in the fol- 
lowing mode: B os -*■ Df+~e + v e , 
DJ~ ->■ (f7t+, <p ->■ K+ .... (a) Overall 



view in the plan 
Detailed view of the vertex detector, showing 
the hits in the two layers of silicon, (c) Zoom 
close to the interaction point (IP), showing 
the reconstructed tracks and vertices [106]. 




Figure 12.43 Lepton impact param 


eter distribution of At candidates. The solid 


probabilitiy function at the fitted va 


ue of the lifetime while the shaded areas show, 


top of the other, the background co 


ntributions [11]. 
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tion a good spatial accuracy can be obtained for r-<p coordinates. The z coordinates 
are usually obtained by layers of slanted wires stretched at a small angle relative to 
the main layers and their accuracy is not as good as that of r or <p. ALEPH used a 
TPC, and with B = 1.5 T, As = 100 (im obtained dp r /p$ = 0.0009 (GeV/c) -1 . 
A s = 100 \im was obtained for particles having transverse track length I = 1.4 m. 
Because of the need to minimize multiple scattering, which restricts the accuracy 
of momentum and also of the long track length, the tracking detector is usually 
placed in the central, inner part of the whole detector. The track chamber can al- 
so identify unstable neutral particles such as K° and A by detecting their decay 
vertices. 

d) Energy Vectors Calorimeters are used to measure energies of charged as well 
as neutral particles and are usually placed outside the tracking chambers. They are 
generally modularized to obtain space point information. One module is usually 
designed geometrically to have a fixed interval of rapidity A?/ and azimuthal angle 
A<p so that counting rate per module becomes uniform. The QCD hadron produc- 
tion rate is known to be roughly constant in rapidity i] or pseudorapidity rj' , which 
is related to polar angle by 



: lln^±i^lln 1+C0Se : 
2 I-PH 2 l-cos0 



-In tan— (12.100) 



In ALEPH, each module extends over a solid angle of 3 x 3 cm 2 or 0.8° x 0.8° and is 
distributed lattice wise. A position resolution of 1.5 mm in determining the show- 
er generation point was obtained. By connecting this point with the interaction 
point the direction and energy (i.e. energy vector) of a particle can be measured. 
Calorimeters are physically and functionally divided into two parts, the electromag- 
netic calorimeter placed inside and the hadronic calorimeter placed outside the 
superconducting coil. The former is composed of a lead/scintillation counter sand- 
wich, which has a total of 23 radiation lengths, whereas the latter is a sandwich of 
iron plates with 5-10 cm thickness and streamer tubes (a gas tube that works in 
streamer mode). The total absorption length is X = 7.6 or ~ 1.2 m of iron. The 
iron absorber plays a double role as the return yoke of the magnetic flux created 
by the superconducting coil. As described, a particle's momentum is measured by 
the track chamber, its energy by the calorimeter and, using the E/p method, elec- 
trons (y's too if there are no tracks) and hadrons are separated. In ALEPH, the 
contamination of hadrons in the electron signal was less than 10 -3 . The energy 
resolution was AE/E = 1.7 + 19/ v^£[GeV]% for the electromagnetic calorimeter 
and ~ 60%/^/l ^ [GeV] for the hadronic calorimeter. 

e) Particle Identification Depending on the physics aim of the detector, a particle 
identification device such as a transition radiation counter to separate e from ji or 
a RICH counter to identify hadron species is usually inserted between the track- 
ing chamber and the calorimeter. ALEPH's TPC serves a double role as a tracking 
chamber and a particle identification device. 
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f) Muon Identification The muon can be identified as a noninteracting long track. 
Therefore a track detector outside the magnet yoke, sometimes with additional ab- 
sorbers totalling > 102, serves as the muon identifier. The only other particle that 
can penetrate this much absorber is the neutrino or yet undiscovered wealdy in- 
teracting particles. Backgrounds for muons directly produced in the reaction are 
decay muons from pions and false signals of pions, a small albeit nonnegligible 
component of pions that survive after the absorbers. 

g) Missing Energy In the center of mass system of the colliding particles, the total 
sum of momenta should vanish. Missing energy is defined as the vector sum of all 
the energy vectors of charged as well as neutral particles but with the sign inverted. 
As each hadronic particle is not identified, they are assumed to be pions or zero- 
mass particles. Then the energy vector is the same as the momentum vector. When 
it is known that only one neutrino is missing the missing energy vector is that of 
the neutrino. A reaction to produce W(u+d ->■ W + ->■ e(fi) + + v) was measured 
this way and the mass of W was determined. 

New particles that are predicted by theoretical models, for example supersym- 
metry (SUSY) particles, also give a missing energy accompanied by (an) energetic 
lepton(s) or mono-jet. The accurate measurement of the missing energy is an im- 
portant element of a general-purpose detector. This is why the hermeticity of the 
detector is important. Still, energies that escape to the beam pipe are sizeable and 
only the transverse component of the energy vectors can be determined. 

h) Luminosity Measurement In addition to the main detector, small luminosity 
counters made of a tracking chamber and a calorimeter are placed in the for- 
ward and backward regions and measure well-known reactions such as small-angle 
Bhabha (e~e + elastic) scattering for electron colliders. As the cross section is well 
known and calculable, knowing the counting rate gives back the intensity of inci- 
dent beam. It is necessary to measure an absolute cross section of a reaction. 

i) Summary A collider detector has all the properties described above, and can 
serve as a general-purpose detector. If the total energy J~s is known, reactions that 
can be induced can be more or less be predicted and similarity in designs of differ- 
ent groups is conspicuous. For instance, at the LEP/CERN collider, */s = m z o = 
91 GeV and four detectors ALEPH, DELPHI, L3 and OPAL were constructed. The 
major goals to measure the mass, decay width and modes of Z° accurately were 
common to all and all the detectors were capable of doing it. Therefore, from the 
standpoint of testing the validity of the Standard Model, differences between the 
detector performances can be considered small. The difference, however, appears 
in the details of the detector component, depending on whether one wants to make 
precision measurements of leptonic modes or to carry out detailed tests of QCD or 
whatever. As a result of this, sensitivity to or the potential for in-depth investigation 
of a new phenomenon may be different. How much effort is put into apparently ir- 
relevant functions of detectors strongly reflects the group's philosophy and insight. 
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Statistics and Errors 

12.7.1 

Basics of Statistics 

Particle reactions are statistical phenomena. Detectors are devices that convert a 
particle's passage into signals that a human can recognize and their perfor 
also dictated by statistics. We describe here basic concepts of statistical pher 
to understand the quality of measurements and the errors associated with them. 
Frequently used formula that are stated in a standard textbook of statistics and 
probability are given here without proof. Proofs are given only when they are spe- 
cially connected to particle measurements. 

There are no measurements without errors. They come in two kinds, systemat- 
ic and statistical. Systematics are associated with the setup of reference points of 
measurement and make measured values either overestimated or underestimated. 
When the source of the error is understood, it could be removed in principle. But 
some part of it remains always unexplained or uncontrollable. For instance, we say 
the peak of Mt. Chomolungma is 8848 m above sea level. The reference point is 
at sea level h = 0, but the value of h could be either +h or — h and we nev- 
er know its exact value. However, by careful arrangement, it is possible to set an 
upper limit to it, i.e. (\h\ < e) and e is referred to as the systematic error in the 
measurement of mountain heights. The systematic error has its origin in the setup 
of each experiment (or an experimenter's skill) and cannot be discussed in unbi- 
ased way. Therefore, we only discuss statistical errors. Their origin inherently lies 
in the measured object itself and/or in the measuring devices. The measured val- 
ue fluctuates irregularly (at random) around a true value. Repeated measurement 
can decrease the size of statistical errors, but smaller values than the systematic 
error are fruitless, so usually measurements are ceased when the statistical error 
becomes comparable to the systematic error. 

The behavior of errors or their distribution depends on the circumstances. When 
the probability function of events is unknown, a normal distribution, to be de- 
scribed later, is assumed. With such a blind assumption, usually no serious mis- 
takes are induced in most circumstances because of 

1 . the law of large numbers , 

2. central limit theorem, and 

3. general properties of the normal distribution. 

Law of Large Numbers Given a random variable x with a finite expected value (/< ) , if 
its values are repeatedly sampled (sample value X;), as the number n of these obser- 
vations increases, their mean will tend to approach and stay close to the expected 
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value. Namely, 

lim J2 — -► f* (12.101) 

Central Limit Theorem The central limit theorem states that as the sample size 
n increases the distribution of the sample average of these random variables ap- 
proaches the normal distribution with a mean ft and variance (to be described soon) 
a 2 /n irrespective of the shape of the original distribution. 

However, one has to be careful because very often one tends to put too much 
confidence in the omnipotentiality of the normal distribution. 

We first define some terminology and a few definitions. 
Average Value or Mean If sample events described by a variable x are realized in- 
dependently according to a distribution function f(x), the average fi is defined by 

fi = J dxxf(x) = E(x) (12.102) 

with normalization 

Jdxf(x) = l (12.103) 

Generally E(g(x)) is defined by 

E(g(x)) = jdxg(x)f(x) (12.104) 

When the variable x is not continuous but discrete, replace /(*;) by /; and take the 
sum instead of integral (f dx — >• ^J. In the following we take x as a representative 
of both continuous and discrete variables unless otherwise stated. 
Variance This is a measure of dispersion of the variable x around the mean fi and 
is defined by 

a 2 = E((x - fi) 2 ) = f dx(x - fi) 2 f(x) (12.105) 

a = sTo 2 is called the standard deviation. It is easy to verify that 

E((x - fi) 2 ) = E(x 2 ) - E(x) 2 = E(x 2 ) - ft 2 (12.106) 

Correlation When there are two statistical variables x, y with means fi x , fi y and 
standard deviations o x , a y , their covariance and correlation r are defined by 



cov(x, y) = E((x - fi x )(y - fi y )) (12.107a) 

cov(x vl 
r = 5-^Z (12.107b) 

r takes values —1 < r < 1. When there exists a linear relation such as y = ax + b, 
\r\ = 1, and when x and y are completely independent, r = 0. "Independent" 
means the distribution function can be separated like f{x,y) — g(x)h(y). Fig- 
ure 12.44 shows some examples of correlation. 
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Figure 12.44 Examples of correlation. 

Normal Distribution The normal distribution is also called the Gaussian distri- 
bution. It is the most frequently occurring function in all statistical phenomena. 
Generally, measured values and detector fluctuation (errors) can be considered as 
obeying the normal distribution. The functional form with mean fi and standard 
deviation a is given by 



G(fi,o) = 



\ 



■ exp 



2a 1 



(12.108) 



Figure 12.45 shows its functional form. The area in \a range (fi — a < x < fi + a) 
is 68.3%, la 95.5%, and 3a 99.7%. I7) In the figure, 90% of the total area is shown 
as a white area with boundary at | x — fi\ = 1.64ct. It is worth remembering that the 
probability of the variable x deviating more than one standard deviation is as large 
as 30%. 




Another measure of dispersion is the full width at half maximum (FWHM), 
which is often quoted instead of the standard deviation. This is the x range where 
the probability exceeds half the maximum value. FWHM is denoted as r and for 
the normal distribution is given by 

r = 2rW21n2 = 2.35tr (12.109) 

Overlap of the Gaussian Distribution Let there be two independent variables x 1 , x 2 , 
both following Gaussian distributions with mean and variance a\, a\, and mea- 

17) Conversely, the values of n (\x - fi\ < no) that give 90%, 95% and 99% area are 1.64, 1.96 and 
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sured as the combination 

x = ax t + bx 2 (12.110) 

Then their overlap will obey the distribution 

G(*)= [ d Xl dx 2 -=^exp\-pj\-= ^expT-^J (12.111) 

Its Laplace transform is given by 

x G(x) = exp ^ 2 (a 2 <7 2 + fa 2 <7 2 ) (12.112) 

This is the Laplace transform of the Gaussian with the standard deviation 



f 



a = yja 2 a\ + b 2 a\ (12.113) 

Namely, the error of an overlap of two independent statistical phenomena is giv- 
en by the square root of the weighted sum of the error squared. If the difference 
instead of the sum is taken in Eq. (12.110), the error is still given by the same 
formula. 

Poisson Distribution Uniformly distributed but independent events obey the Pois- 
son distribution. Reactions and decays in particle physics, the number of ions per 
unit length created by passing charged particles, electrons emitted from a heated 
metal surface or the photocathode of photomultipliers, all of which are intimately 
connected with detector operations, obey the Poisson distribution. If we let the av- 
erage occurrence of an event per unit time be fi , then the probability that it occurs 
r times in time t is given by 

P(^;r) = ^- e -" ! (12.114) 

This can be explained as follows. Let us assume that the event occurred M times in 
a time interval T. Both T and M are taken to be large enough. Then the probability 
that r events fall in a small time interval t (t <JC T) is given by p r (l — p) M ~ r , where 
p = t/T. As there are u C r ways to sample r out of M events, the probability that r 
events occur in the time interval t is given by the binomial distribution 

P(M;r) = M C r p'(l-p) M - r 

_ M(M-l)---(M-r+l) 



p r (l-p) M 
{ l-(r-l)/M}, w . 



''(-£)" 
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POISSON DISTRIBUTION 




15 20 2 
Figure 12.46 Poisson distribution: (fi r /r\)e~". 

Taking the limits M — >• oo, T — >• oo, keeping Mp = Mt/ T —>■ [it fixed, and using 
liniM^.oofl — x/M) M = e — *, we obtain the Poisson distribution Eq. (12.114). The 
mean occurrence r of events in unit time interval and the variance of the Poisson 
distribution are given by 



= E rP ( r ) = ^' ° 2 = J^(r-fi) 2 P(r) = [i 



Figure 12.46 shows the Poisson distributions for several values of /.i. Note the dis- 
tribution is not symmetric around fi. If we equate /u = n, when n is large the 
Poisson becomes the Gaussian with mean n and variance n. We might as well say 
it is a good approximation already at n = 10 (see Fig. 12.46). This is an example of 
the central limit theorem. 

Problem 12.3 

When n is large, show that the Poisson distribution with mean n, variance n ap- 
proaches the Gaussian distribution: 

lirn (/(r) = ^e~A -> -^== exp [-^^-1 (12.117) 

Hint: Use Sterling's formula 

n! = V2^n" +1/2 e-" (12.118) 

and expand In f(r) in a Taylor series. 

Example of Application: Deciding the Upper Limit of the Probability Value when 
no Events Are Observed When one tries to determine the validity of a certain the- 
oretical formula, for instance the lifetime of double beta decay, and ends up with 
no observation, the upper limit of the rate or the lower limit of the lifetime can be 
determined as follows. Let the decay rate be X. The probability that no events are 
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observed during the observation time Tis given by the Poisson distribution as 

P(A;0) = e~ AT (12.119) 

The equation can be interpreted as the probability of observing no events as a func- 
tion of A. Therefore, the probability X < X u is given by 



T is the normalization constant when X is taken as the variable. We call the above 
equality the confidence level (CL) of X lying in the interval S X < X u , 

:. X u = ln(l-CL) (12.121) 

For CL = 90%, X u = 2.3/ T. Namely, when no events are observed, the upper limit 
with 90% confidence level is given by the rate with 2.3 as the number of events. 
Simultaneous Observation of Two Independent Phenomena Suppose one is mea- 
suring the lifetime of a particle. The counts one obtains include noise of the detec- 
tor, which is also statistical but of different and independent origin: 

n = rtj (particle) + n 2 (noise) (12.122) 

The probability of n events being observed is given by 

P(fi 1 ,fi 2 ;n)= J2 P(/<i;»i)^2;n2) = £^e-^^p e -« 

= (fii +J 2 ) n e - {fll+/ll) = p(fii + ^. n) (12 m) 

12.7.2 

Maximum Likelihood and Goodness of Fit 

a) Maximum Likelihood Method The real value fi and its variance of a measured 
object can never be known, but a good guess can be obtained by measurements. 
Suppose there are n independent samples X\ (i = 1 . . . n), which we call a statistical 
variable obeying a distribution function f(x; a), a is a parameter to be determined. 
The maximum likelihood function is defined by 

Hx 1 ,...,x n ;a)=f(x 1 ;a)f(x 2 ; a )...f(x n ;a) (12.124) 

It can also be considered as the probability that gives the observed values (x 1 ,x 2 ,..., 
x n ). If the function L is analytic, the value of a can be obtained by solving 
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The two equations are equivalent in obtaining a and either one is used depending 
on the situation. When there is more than one parameter, simultaneous equations 
are obtained. When the solution of Eq. (12.125) is written as a, it is not the true a 
but an estimated value. An estimated variance is given by 



*-/ 



ix x ... dx n (a - a) 2 L(x u . . . , x n ; a) (12.126) 

lowever, is not necessarily calculable unless the analytical form of 
f(x; a) is known. When the probability function is given by the Gaussian, the max- 
imum likelihood function is given by 

If we write the estimate of /* as ft and that of a 2 as a 2 , L is maximized by 

d/u *- a 2 * n £- 

(12.128a) 

d 2 L [ n v-^l/X; — M\ 2 1 „, lv^ , 

dfi 2 I a 2 ^ a 2 V a 2 ) J n ^ 

(12.128b) 

As the real fi is unknown, it is replaced by ju . Then the degrees of freedom are 
reduced by 1 and it is known that 



2 =^I>-^=^ 2 



gives a better estimate of the variance. If n is sufficiently large, the difference is 
small. The arithmetic average x is an estimate of//, but is not the real (A. This 
means another n samples will give a different x. In this sense, x itself is a statistical 
variable. Consequently, the variance of x can be defined: 

o 2 (x) = E((x -ft) 2 ) = ^E 11 i^> -A = 1 £ £((x ; -/<) 2 ) 



Namely cr(x) = ^fa 2 !^) = a/^/n gives the error associated with the arithmetic 
mean x. a(x) is referred to as the standard error of the mean. As is clear from the 
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above derivation, the equality o(x) = o/n holds independently of the functional 
form of the distribution. 

The reader is reminded here that a 2 is an estimate of a 2 , which is the real vari- 
ance of the variable x, and not an estimate of the variance of 3c. cr decides the fluc- 
tuation of the variable x; distribution and does not change no matter how many 
samples are taken. o 2 approaches a 2 as n increases. Likewise x approaches true fi 
as n increases, and its standard deviation is given by a/^/n, which approaches as 
n increases. 

Now, note that at a point 31* /d/i = 0, 



d 2 L* 
dfi 2 



1 



(12.131) 



Equation (12.126) is a general expression, but when it cannot be calculated analyt- 
ically one applies the central limit theorem and assumes the same formula as for 
the Gaussian distribution is valid. Namely, 



Note, however, this is generally an approximation and valid only when the statistics 
is large. When more than one parameter is involved, an error matrix U can be 
defined as the inverse of 



Then it can be shown that the diagonal elements a] are equal to the variance of a 
and the nondiagonal elements are covariances. Namely 

cov(l,2) cov(l,3) • 
a\ cov(2,3) • 



We note that the estimate determined by the maximum likelihood method is 
variant under transformation. That is, if b = g(a), the best estimate of b is g: 
by b = g(«). 



b) Propagation of Errors One often wants to calculate another quantity using the 
obtained sample variables. Here we are interested in the error of u = f(x, y) given 
(x, y) with standard deviations a x , a y . The variance a\ of u is given by 

o 2 u =E((u-u) 2 ) (12.135) 

In the first approximation, u = f(x, y). How the variable u moves around u can 
be obtained by the Taylor expansion 
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Inserting the square of Eq. (12.136) in Eq. (12.135), we obtain 

3/3/ 
Using Eqs. (12.105) and (12.107), we obtain 

< = ^ (I) 2 + °5 (I)' + 2cov <** ^Ef (12 - 138) 

Namely, the variance of u is the sum of the squares of its argument errors mul- 
tiplied by their derivative. The third term vanishes if the variables x and y are in- 
dependent, but if there is correlation between the two, the propagation error is 
modified greatly from the sum of squares. In the following we ignore correlations. 

Example 12.1 

±y 

(12.139) 

x + ° Y 

which reproduces Eq. (12.113). 

Consider an experiment with a neutron target. This is achieved by subtracting 
proton target data from deuteron target data. Let n x , n 2 be the number of events 
from each experiment measured in the time intervals t 1: t 2 . One wants to decide 
the best balance in dividing the time into t\ and t 2 with the total time T = t\ + t 2 
fixed. The counting rates are r x = n-i/t±, r 2 = n 2 /t 2 and r x — r 2 is the desired 
number. As the distribution is Poissonian, the variance a 2 of n is a 2 = n and the 
error dn = *Jn. Using Eq. (12.139), 

^ = ^ + ^ = ^ + ^ = 5 + ^=^ + ^- (12.140) 

we differentiate Eq. (12.140) with respect to t\, put it equal to 



R- L 

A =¥TT 






dA 21 
dR ~ (R + I) 2 ' 


dA 
91 ~ 


-2R 


put N = R + L: 
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Example 12.3: Asymmetry of Angular Distribution 

Asymmetry of a distribution is a frequently measured quantity. Let R be the num- 
ber of counts in the interval < cos 6 < 1 and L in —1 < cos 8 < 0. We want to 
estimate the error of its asymmetry 

(12.143) 

(12.144) 



As both I and R obey the Poisson distribution a\ = R, a\ = L 
2 _4L 2 R + 4R 2 L_ RI_1-A 2 

When A is small 



The error has to be smaller than the value of the asymmetry itself. Let us define 
figure of merit (FOM) by 

FOM = — (12.14 

The number of events Nto obtain FOM is 



Let us further assume one wants at least FOM > 3. Then the required number of 
events is 

(30 000 A = 0.01 
N = I (12.150) 

( 9 A = 0.5 V ' 

The experimental time to achieve the desired FOM depends greatly on the value of 
the asymmetry itself. 
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12.7.3 

Least Squares Method 



Consider a function y = f(x; a 1 ,a 2 ,--- , a m ) having x as a variable. We want to 
determine the value of m a;'s. Assume a measurement was carried out at n points 
x; (i = 1, . . . , n) and measured values y; and their errors a; have been obtained. 
The least squares method minimizes the following quantity: 



■£[■ 



y,--/(x,-;a 1 ,--,a m )-| 



In order to obtain a meaningful result the number n must be larger than the num- 
ber of parameters to be determined. If the distribution is Gaussian with the mean 
f(%i\ a\, ■ ■ ■ , a m ), it is easy to prove the result agrees with that given by the max- 
imum likelihood method. In this case S is called the chi square (% 2 ) distribution. 
Consequently, it is often said that the least squares method minimizes the % 2 dis- 
tribution. If the y; distribution is not Gaussian, the statement is not exact, but is 
not far from the truth because of the central limit theorem. Note we have assumed 
no errors in determining X\ in Eq. (12.151). If errors have to be assigned, 



(12.152) 



( d f\ 



has to be used instead of a 2 y . To obtain the best fit, S has to be ; 
conditions 



give associated equations to be solved. When analytical solutions are not available, 
numerical calculations using computers have to be carried out. With similar argu- 
ments as before, the error matrix U of cij is given by 

, 1 d 2 S 

u 2dd^~ (12 - 154) 

If S is nonlinear, computer calculation is the only way. In this case, the error matrix 
has to be solved numerically, too. A way to determine the variance is again guided 
by relations of the Gaussian distribution. When the distribution is one-dimensional 
Gaussian, we have 



Taylor expanding S around the minimum, we obtain 



S(a) ~ S(S) + ha - a) 2 ^§ = S(a) + -(a - a) : 
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Therefore, we have the equality 
S(a + a) = S(a) + 1 



(12.157) 



Generalizing the relation, we may define the standard error a as the deviation of 
a from a to give S the value S min + 1. Equation (12.155) also shows how to obtain 
a numerically. When there are two parameters to be determined, a closed contour 
where S = S min + 1 gives an error region within la in the ai~a 2 plane. 

X 2 Distribution The chi square (y 2 ) distribution is a theoretical probability distri- 
bution in inferential statistics, e.g., in statistical significance tests. The % 2 distribu- 
tion with n degrees of freedom using x (x > 0) as its variable is given by 



K n (x) = 



(12.158) 



2»/2/>/2) 
where r(x) is the Gamma function. The function's form is given in Fig. 12.47. 

Suppose the distribution function of a certain variable fi is given by F(fi, a;), 
where a ; are m parameters one wants to determine by an experiment. Let y; be 
measured points of /.i obeying the normal distribution G(/.ii, a 2 ). Then y} defined 
by 



-z 



(Yi-f^f 



(12.159) 



obeys Eq. (12.158). The % 2 distribution is referred to as the goodness of fit and 
is a measure of how well the determined parameters a; reproduce the measured 
values. The measured n values y; — fi, (i = 1, . . . , n) are independent variables, 
but if the average y is substituted for pi, y; — y are n — 1 independent parameters 
and n — 1 should be used as the degrees of freedom in applying the % square test. 
When there are m parameters to be determined, the number of degrees of freedom 
becomes n — l — m. The confidence level CL is given by 



CL(S) = 



dxK n (x) 



(12.160) 




Figure 12.47 ^distribution: K„(x) = (x"! 1 ^ e-*/ 2 )/(2"/ 2 />/2)) and the reduced y} i 
to y 2 jn for n degrees of freedom. The curves are given for given values of CL. 
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Usually, taking CL = 5% and when % 2 < S, the fit is approved as good. As a 
rough estimate, if fi is considered as a good representative, (y; — fi) should take the 
approximate value a 2 and so x 2 /( n — 1) ~ 1 is expected. If the value of y} is too far 
away, the original theoretical assumption has to be reconsidered. 



Part Two A Way to the Standard Model 



13 
Spectroscopy 



Spectroscopy originally meant the use of a prism to disperse visible light and an- 
alyze a complex wave form as a function of its wavelengths. The concept was ex- 
panded to include any measurement of observables as a function of wavelength 
or oscillation frequency. As the frequency is equivalent to energy in quantum me- 
chanics, collecting and sorting of energy levels in any phenomenon is also called 
spectroscopy. It is an indispensable procedure in science to elucidate the under- 
lying basic rules from observation of seemingly complex phenomena. The role of 
the periodic table in understanding atomic structure and radiation spectra in clar- 
ifying the nature of quantum mechanics come to mind immediately. In a similar 
vein, collecting the masses of hadrons, including both elementary particles and 
resonances, which were thought of as dynamic processes of interacting elementary 
particles, and determining their spin, parity and other quantum numbers were es- 
sential procedures in reaching the quark structure of the hadrons. Later on, we will 
deal with level structures of quark composites, but here we follow history to learn 
a heuristic way to reach the truth. The first step is to collect facts. The second step 
is to sort them out, find a regularity and see if they can be reproduced by a simple 
model. The third step is to elucidate the underlying law that governs the dynamics 
and reach the truth. 1 ' 

Discoveries in the pre-accelerator era were made using isotopes and cosmic rays. 
That was the age of hunting particles, exciting but very much dependent on luck. 
With the advent of large accelerators - Cosmotron (3.3 GeV, 1953-1968), Bevatron 
(6.2 GeV, 1954-1971), PS (28 GeV, 1959-) and AGS (33 GeV, I960-) -the age of har- 
vesting particles had arrived. It became possible to plan and prepare experiments 
in an organized way, controlling the particle's conditions. 

In this chapter, we describe how to extract information on hadrons (ji, nucleon, 
K, p, to, etc.) and resonances and determine their properties. This is the first step 
described above. We start from the basic structure of the interactions between pi- 
ons and nucleons, which are representatives of the meson and baryon families of 
the hadron. Analysis of hadron interactions exemplified by jz-N scattering is es- 
sential in order to understand their dynamics. By analyzing their behavior we can 

1) This is Sakata's syllogism that prevailed in his school, whose pupils proposed IOO symmetry [212] 
(flavor S t/(3) of the Sakata model [339]), a predecessor of the quark model, lepton-baryon 

symmetry, the fourth quark [271] and the Kobayashi-Maskawa model [239]. 
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extract resonances and determine their spin-parity as well as other features. Classi- 
fication of resonances and construction of a model are the second step, which will 
be discussed in the next chapter. A brief description of low-energy nuclear forces 
and the high-energy behavior of hadronic cross sections is given in the last part of 
this chapter. 



13.1 

Pre-accelerator Age (1897-1947) 

The history of particle physics as a research discipline may be said to have begun 
with Fermi's (1934) and Yukawa's (1935) proposal of the weak and strong (meson) 
theories. However, the first discovery of an elementary particle as we know it today 
dates back to as early as 1897, when J.J. Thomson discovered the electron, although 
the early history of the particle is more closely related to atomic structure and the 
quantum concept of particle-wave duality. The photon was recognized as a particle 
following Einstein's light quantum hypothesis in 1905 and subsequent demonstra- 
tion of Compton scattering in 1925. The discovery of the neutron in 1935 showed 
that the nucleus consists of particles. It triggered Fermi's and Yukawa's applica- 
tions of field theory to the weak and strong forces, incorporating particle creation 
and annihilation. In their theories, new particles (the neutrino and the meson) are 
claimed as indispensable players in the dynamical phenomena. Actually, the neu- 
trino was proposed by Pauli in 1930 to reconcile apparent violations of energy and 
angular momentum conservation in beta decays. Fermi included it and applied the 
quantized field theory invented by Heisenberg and Pauli to beta decay. Yukawa's 
treatment in his first paper was semiclassical but his idea of a quantum of a force 
carrier was revolutionary and remains one of the prime axioms in modern particle 
theory. 

It was the beginning of new era in which people no longer hesitated to claim 
new particles to explain newly discovered phenomena. Until then, the change of 
the concept of matter aroused by the developement of quantum mechanics had 
been so violent that people were more willing to abandon fundamental laws than to 
introduce dubious new particles. For instance, Bohr, who represented mainstream 
thinking at the time, opted to discard energy conservation in beta decay rather 
than introduce unknown new particles. That he did not like Yukawa's idea either 
was indicated in his comment "You like new particles so much" when Yukawa 
presented the new theory to him on his visit to Japan in 1937. 

Electron The first particle recognized as an elementary particle was the electron. It 
was known for a while as the cathode ray, which is emitted from a hot filament in a 
vacuum (low pressure to be exact) glass tube when high voltage is applied between 
an anode plate and the cathode. In fact, Hertz, who proved the existence of the 
electromagnetic wave experimentally, knew it. He erroneously concluded that the 
cathode ray was not charged. It was only after vacuum technology improved that the 
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Figure 13.1 The apparatus with which 
J.J. Thomson discovered the electron. A high 
voltage is applied between the cathode and 
the anode plate. The emitted cathode ray goes 
through a hole in the anode, is collimated by 



rescent point. When a transverse electric field 
is applied across two plates in the middle (de- 
flector), the cathode ray can be deflected [366]. 



deflection of the cathode ray was detected by J.J. Thomson in 1897 in the apparatus 
shown in Fig. 13.1 [366]. 

The cathode ray, after going through collimators, is deflected by a transverse elec- 
tric field. From the direction of the deflection the charge can be determined. The 
beam can also be deflected by a magnetic field created by two coils applied perpen- 
dicularly to both the beam and the electric field. By adjusting the field strength to 
compensate the deflection by the electric field, the velocity of the particles can be 
measured. The value of e/ m can be determined by the deflection angle created by 
the electric field. Its value was a thousand times smaller than that of the hydrogen 
ion (the proton) measured in electrolysis. Thus Thomson concluded that the cath- 
ode ray consisted of negative particles with mass much smaller than any known 
matter. An accurate value of the electric charge was obtained by Millikan's oil drop 
experiment and the mass was determined. 

Atomic Structure Thomson proposed an atomic model that consists of a positive 
charge cloud of size ~ 10 -8 cm in which electrons are embedded like the seeds 
of a watermelon. The other model to be contrasted with Thomson's was Nagaoka's 
saturnian model [284], in which electrons circulate around a positive nucleus. The 
models could be tested by making an a -particle scatter on matter. The a -particle 
was discovered by Becquerel, and by 1911 it was known to have charge +2e and 
a mass four times that of hydrogen. Since the a-particle is much heavier than the 
electron, it would not be deflected by electrons. Deflection by the positive charge 
would also be weak in the Thomson model because of its extended distribution. 
Rutherford noticed that these characteristics of the Thomson model did not agree 
with observations. He further conjectured that if the Thomson model were right, 
large-angle scattering is possible only as an accumulation of small scattering an- 
gles and the distribution would be Gaussian. On the other hand, if the charge is 
concentrated in a small region and the target mass is large, when the a-particle hits 
the atom head on it will be repelled back. If the collision is not head-on but within 
a small transverse distance (impact parameter), the deflection would still be large. 
Figure 13.2 shows trajectories (magnified 200 billion times) that an incoming par- 
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tide makes for different impact parameters under the influence of the repulsive 
Coulomb force. 

Following Rutherford's suggestion, Geiger and Marsden carried out an experi- 
ment using the detector shown in Fig. 13.3 and obtained the angular distribution 
shown in Fig. 13.2, which agreed with Rutherford's formula given in Eq. (6.46). The 
formula was calculated classically assuming the scattering pattern in Fig. 13.2 and 




Figure 13.2 (a) Trajectories of incoming par- 
ticles with different impact parameters scat- 
tered by the Coulomb force of the fixed scat- 
tering center. Consider it as a 2 x 10 11 times 
magnified picture of an a-particle scattered by 
an atom. When the impact parameter is small, 






tj Microscope p[ — ~^§jf*~ ~ Source 




._ 



the a-particle is deflected through a large 
angle, (b) Data fit well with the Rutherford 
scattering formula. The dashed line indicate: 
the predicted scattering distribution for the 
Thomson model [158]. 




Figure 13.3 Side (a) and plan (b) view 
of Geiger and Marsden's apparatus: an a 
source, scattering gold foil, and a microscope 
whose front end is coated with fluorescent 
material (ZnS) are contained in a metal cylin- 
der. The source is concentrated radon and 
a-particles are incident on the foil at a right 
angle through a slit. The source and the foil 



are fixed to the axis, but the cylinder together 
with the microscope can be rotated around 
the axis so that the scattering angle of the 
a-particles can be varied. The measurement 
was carried out by flickers of the fluorescence 
being observed by the human eye and count- 
ed [158]. 
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an uniform impact parameter distribution. Rutherford was fortunate because it is 
a rare example of the classical treatment producing the same result as quantum 
mechanics. 

The expected curve according to the Thomson model is depicted as a dashed 
line. As we discuss later in Chap. 17 or more simply according to the Heisenberg 
uncertainty principle, a measure of the minimum distance that the projectile can 
penetrate into the target is given by the de Broglie wavelength of it 
transfer Q = 2p sin(f3 /2) < p . Since the data agreed with a formula that a; 
a point target, the size of the nucleus is at most the inverse of the a -particle mo- 
mentum. Assuming kinetic energy of 4 MeV, which gives p ~ 170 MeV/c, one can 
estimate the nuclear size to be at most r ~ he /170 MeV/c ~ 10 — 13 cm, which is 
10 5 times smaller than an atom. This experiment unambiguously established the 
fact that the atom has a structure with positive charge concentrated in a region 
r < 10 -13 cm and a surrounding electron cloud extending to 10 -8 cm. 



Neutron In 1932, Irene and Frederic Joliot-Curie noticed that neutral particles 
with strong penetrating power are emitted when a-particles from polonium col- 
lide with a beryllium target. Unfortunately, they mistook them for energetic gam- 
ma rays. Chadwick, knowing Rutherford's idea of the possible existence of a neutral 
heavy particle, a possible composite of the proton and the electron, proved in 1932 
that the mass of the neutral particle of the Joliot-Curies was as large as that of the 
proton. His apparatus is shown in Fig. 13.4. The mass measurement was done as 
follows. In Newtonian mechanics, if a ball having mass m collides with another ball 
with the same mass at rest, the incident ball stops and the target moves on with 
the same velocity as that of the incident ball. If the target ball has mass 14 m, the 
velocity that the target obtains is 2/15 that of the incident ball. These numbers can 
be obtained by a simple calculation using the law of conservation of momentum. 
Placing a paraffin foil which contains hydrogens or nitrogen (m N = 14 m H ) gas 
chamber, the maximum velocity of recoiled hydrogens or nitrogen are those ob- 



Ionization Chamber 




(OC Sou. 



Berylli 



is for detect- 



Figure13.4 Chadwick's app 
ingthe neutron. The neutron is knocked out 
from Be by the a-particle emitted from polo- 
nium. Various materials are placed in front 



nization chamber on the right, which 
is the ionization loss of charged parti- 
tered by the neutron [94]. 



448 I 13 Spectroscopy 

tained from head-on collisions. The velocity can be determined from the ionization 
loss of the particles, which is converted to electric current, amplified and automat- 
ically recorded. It was the most advanced instrument at the time. The ratio of the 
velocity as determined by ionization loss between the paraffin and the nitrogen gas 
foil was 2/15, confirming that the mass of the neutron was equal to that of the 
proton. 

Positron Anderson's discovery of the positron (1933) has already been described 
in Chap. 4 in explaining the concept of the antiparticle. We only mention that the 
discovery was made by using a cloud chamber, which was invented by Wilson in 
1911. It played an important role also in discovering the muon and pion, as well 
as strange particles later. It is a sealed box containing supercooled, supersaturated 
water or alcohol vapor. When a charged particle goes through the chamber, it ion- 
izes the matter along its path. The ions act as condensation nuclei, around which 
a mist will form, creating a track just like an airplane contrail. The velocity and the 
momentum can be measured by the particle's ionization losses and the curvature 
of the track in the magnetic field. 

Muon and Pion The discovery by Anderson and Neddermeyer in 1937 [293] of 
a particle with mass ~ 200 m e in cosmic rays aroused a considerable amount of 
excitment in view of Yukawa's prediction of the existence of a nuclear force carrier. 
However, the particle had a long life, interacting weakly with matter, which cast 
doubt on its role as the strong force carrier. It took a long and confusing time before 
the issue was finally settled by Powell's demonstration of sequential decay (it -*■ 
fi ->■ e) on emulsion photographs (Fig. 13.5), thus establishing the pion as the 
strong force carrier predicted by Yukawa. As to the muon, however, although very 
instrumental in clarifying the properties of the weak interaction, its very e: 
i.e. its role, remains a mystery to this day. 



Figure 13.5 Emulsion photograph of n — ft — e decay. A pion decays into a 
and the muon decays into an electron (ju -+ e + v e + v u ) [80]. 



Muon Neutrino The existence of the second neutrino was inferred from the muon 
decay in fi ->■ e + v e + v u , because the energy spectrum of the produced electron 
was not monochromatic but showed a continuous spectrum. That v u might be 
different from v e , which appears in beta decay, was suspected from the absence 
of fi -> e + y , which will be treated in more detail in the discussion of the weak 
interaction in Chap. 15. 
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Strange Particles Strange particles were also discovered in cloud chambers ex- 
posed to cosmic rays, but their detailed properties were investigated using accelera- 
tors. The pursuit of their properties and role in particle physics led to the discovery 
of the quark model, which will be explained in the next chapter. 



13.2 

Pions, nucleons and their interactions are the basis and also the best studied sub- 
jects of all hadrons. This is because the pion is the lightest hadron and an intense 
pion beam can be easily made by bombarding protons on matter. We start by de- 
termining the spin-parity of the pion. 

Spin of Jt + To determine the spin of ji + , the principle of detailed balance was 
applied to the reaction 



Denoting the spin of 71, D, p as s, s D and s p , respectively, and the momentum of 
it, p in their system as k, p, Eq. (9.146) is written as 

do(pp -> JtD) _ k 2 (2s + l)(2s D + 1) _ 3 k 2 
a(itD^pp) p 2 (2s p +lf 2p* { ' [ ' ' 

The factor 2 in front of the equation is a correction factor for two identical particles 
in the final pp system. We used the fact that s D = 1, s p = 1/2 to derive the 
last equation. There is a factor 3 difference in the cross section when the spin is 1 
compared to s = and it is easy to distinguish them experimentally. 

Knowing a(pp -» JtD) = 4.5±0.8mbat T p = 340MeVor T„ = 21.4 MeV [91] 
gives the predictions a(itD -> pp) = 3.0 ± l.Omb if s = and 1.0 ± 0.3 mb if 
s = 1. The measurement showed that a = 3.1 ± 0.3 mb at T x = 25 MeV [100, 126]. 
The spin of the pion was determined to be 0. 

Parity of it~ Let us consider a process in which a slow pion stops in deuterium 
(D), is absorbed by it and then dissociates into two neutrons: 

ji~ + D^n+n (13.3) 

The stopping it is trapped by the Coulomb force to fall in an orbit of the deuterium, 
drops down to the S-state and reacts. Therefore the orbital angular momentum is 
before the reaction. Assuming the orbital angular momentum in the final state 
as I and using the parity conservation law [Eq. (9.73)]: 

P, = P D (Pn) 2 (-l) L (13.4) 

Deuterium is a bound state of the proton p and the neutron n and is known to 
have spin 1 mostly in the S (I = 0) state with a small mixture of D (I = 2) state 
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Pd = P p P n (— 1) £ - As we discussed before, if P p P n = 1 is assumed, the above 
equation leads to 

P* = ("1) L (13.5) 

To determine I, we use the angular momentum conservation. As s„ = 1/2, the 
total spin of the two neutron system is either or 1. Taking into account s n = 0, 
the initial value of the angular momentum is J; = s D = 1. The final value is a 
vector sum of s and L. Then 

s = - J/ = /i = l - 1 = 1 (136) 

s = i -> ; / = i-i, t, i+i -> i = 2,i,o ( ' 

We need one more condition to determine the parity. We note that the final state 
is a two-fermion state and has to be antisymmetric under permutation. Since the 
wave function is a product of both the spin and spatial wave functions, if the spin 
wave function is symmetric then the spatial wave function must be antisymmetric 
and vice versa. As the s = (1) state is antisymmetric (symmetric), L has to be even 
(odd) correspondingly: 

s = 1 ->■ L — odd 
The only allowed value is L = 1, therefore 



ti (13.3) exists, the pion's parity has to be negative. If the 
parity is positive the reaction does not happen. The reaction has been observed and 
the parity of the pion (n~ to be exact) is determined to be negative. 

Spin and Parity of m° The neutral pion decays into two photons with nearly 100% 
probability. This fact alone excludes the pion's spin from being 1. In the ji° rest 
frame, the two y 's are emitted back to back. Taking the z-axis along the photon 
momentum, and writing RL when the photon moving in the z direction is right 
handed (s 2 = 1) and the other moving in the — z direction is left handed (s z = 1), a 
total of four states (RL, LR, RR, LL) are conceivable (Fig. 13.6). As L z ~ (x x p) ■ 
p = 0, J z includes only spin components. RL, LR states, which have J z = ±2, 
cannot be realized unless J > 2. J z of RR, LL states is 0, which does not rule out 
having J = 1. But it J = 1, the wave function should behave as the eigenfunction 
of (J, J z ) = (1, 0), namely Y 10 (6,cp) ~ cos 6, which gives -1 at 6 = n. But 180° 
rotation of R R or L I is equivalent to particle exchange of 1 and 2, which has to give 
+ 1 by their bosonic nature. This is a contradiction, hence J = 1 is not allowed. 

Next we assume the spin of jt° is zero and derive its parity. The parity transfor- 
mation changes the direction of the momentum but not that of the spin. Therefore 
R and I are exchanged, which, in turn, means R R and LL are exchanged. Then the 
parity eigenstate must be of the form 

|p)= RR_±II^ p = ±i (i39) 
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RL LR 

Jz 2 Jz -2 

(a) — - (b) 

-«-'\A^BBBAA,.'\AABBB^Ar»- -«-'\ABBB^iAA/»'\AAiBBBA/*- 
RR LL 

Jz Jz 

(c) (d) 

Figure 13.6 Polarization states of two photons in ji° CM. The wavy lines denote the photon 
momentum and the bold arrows denote their spin orientation, (a) RL, (b) LR havej z = ±2 
while (c) RR, (d) LL have J z = 0. 

Writing the photon polarization vectors as e x , e y , we can express the right-handed 
and left-handed states as 



This means that if P = +1, the polarization of the two photons is parallel, while 
the polarization of the negative parity states is orthogonal. Since the direction of 
the polarization is that of the electric field, electron-positron pairs produced in 
y -*■ e + + e~ tend to be emitted in the direction of the electric field. Among the 
jt° decay reactions, there is a mode called Dalitz decay 

JT°^(e+ + e-) + (e+ + e-) (13.12) 

which can happen via inner bremsstrahlung. Therefore an experiment to measure 
the angular distribution of the two electron pairs can reveal the parity of jt°. A 
theoretical calculation gives 1 ± A cos 6 , where 8 is the angle between the two 
planes that the pairs make and A ~ 0.48. Measurement gives P„ = — 1. 

Tensor Method It is instructive to rederive the previous result using another 
method, the tensor method. It is complementary to the partial wave analysis and 
in many cases simpler and intuitive. We try to construct the S-matrix, at least part 
of it, using materials available in the problem. We note first that the S-matrix is a 
scalar because of rotational invariance. In the decay jz° — >■ 2y , if jz° is has spin 0, 
the wave function has only one degree of freedom and is a scalar. But if jz° has spin 
1, it has three degrees of freedom, hence the S-matrix has to be a scalar product 
of the pion polarization vector n with another vector. The same argument applies 
to any incoming or outgoing particles. Therefore the amplitude must be linear in 
ei, tx and also n if the pion is a vector. The materials to construct the S-matrix are 
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Table 13.1 Tensor analysis of jt° — ► y y decay amplitude. 
J p (x°) Amplitude Polarization direction 

0+ (£l-£2)/(k 2 ) «1 0*2 

0- (k- ei xe 2 )f(k 2 ) fl ±e 2 

I" (^••fe)(« 1 -« 2 )g(fe 2 ) ei ||« 2 

1+ (^■• ei x f2 )g(fe 2 ) ei _Le 2 

Note: kxjdX e 2 ) = ei (fc ■ « 2 ) - e 2 (fe • «i) = 



the momenta (k 1( fc 2 ) and polarization vectors (f i, e 2 ) of the photon. The problem 
simplifies if we choose the working frame properly, namely, the pion rest frame 
and the Coulomb gauge, ki = — k 2 = k and e x • fe = e 2 ■ k = 0. The amplitudes 
that can be made of these materials are listed in Table 13.1, where f(k ), g(k ) 
represent the remainder of the amplitude. 

Under the parity transformation, fe -* — k and the pion wave function also 
changes its sign depending on its intrinsic parity. Since the final state is composed 
of two identical bosons, the amplitude has to be symmetric under exchange of the 
two particles, i.e. fe ** — k and e\ ** e 2 - The 1 ± amplitudes do not satisfy this re- 
quirement and are forbidden. The remaining choice is between 0+ and 0~, which 
must be determined experimentally. 

While the possibility of s > 2 is not excluded, it is unlikely considering the fact 
that the three members of the pion (jt~*~ , Jt°, jt~) have almost the same mass. They 
constitute an isospin triplet (to be discussed later), and are considered to share the 
same properties. If s > 2, the angular distribution of jz° —*■ 2y would be anisotrop- 
ic, which is not observed experimentally. In high-energy reactions, a great number 
of pions are produced and their multiplicity is expected to be proportional to their 
degrees of freedom (2 s + 1). But it is the same for all ji ± , jt°, so we may assume 
they have the same spin and parity. Today, pions are considered as composites of 
u, d quarks and u, d antiquarks, jt + ~ —ud, jt° ~ uu — dd, ji~ ~ du. Their total 
spin is s = 0, the relative orbital angular momentum I = and they constitute 
J p = 0- mesons. 

Problem 13.1 

Prove the following statements assuming parity invariance. 

1. The decay 0+ ->• + is allowed, but 0~ ->• + is not, where the final state 
is composed of two identical particles. Experimentally, 17 (548) is known to have 
spin and 
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which tells us that r/ has J p = . 

0+ -► 0~ + 0~ + 0~ is forbidden. 

0~ -»■ 0+ + y , 0~ + y are forbidden. 

0~ -► 0~ + 0~ + 1~ is allowed. 

l~ — ► — + 1 — is allowed. 

Let slow antiprotons stop in hydrogen. Prove that the process p 4 

forbidden. 



C Parity As was shown in Section 9.3.4, the transformation of a particle doublet 
to an antiparticle doublet is obtained by the successive operation of C and e~ ,jrl2 , 
rotation about the axis 1 2 by angle it. It is referred to as the G operation: 



u° oT.HH 



The fact that the G-operated state transforms in the same way under isospin ro- 
tation means that the G and isospin operations are commutable. Let us operate 
with G on the pion fields. The C operation interchanges ji = (7i\ + ijr 2 )/V2 and 
71^ = (Jt\ — iii2)l*j2, which means n 2 —> —n 2 - The charge symmetry operation 
is a rotation around the y-axis and changes n\, it$ — >• —Ji\, — jt 3 . The net effect of 
the G transformation is to change the sign of all the pions: 

Gjt = -7T (13.15) 

This means a system of n pions has G parity (—1)". Unlike C parity, G parity can 
be defined for a charged system and is more convenient. However, since the G 
operation includes a rotation in isospin space, the applicability is limited to the 
strong interaction. In the electromagnetic interaction, G parity is not conserved. 
Conversely it can be used to differentiate one from the other. Especially useful is 
an application to the neutral J 3 = system. 

■ Theorem 13.1 

The G parity of a system consisting of a particle q and an antiparticle q is given by 
( _ 1)S+I +i. 



Proof: As the isospin mathematics is the same as that of the angular momentum, 
the eigenvalue of e~ lJth has the same form as that of e ,jrL '. As the system is neutral 
and has 7 3 = 0, the eigenfunction is Y I0 ~ P; (the Legendre polynomial of 7th 
order). As Pj(— x) = (— l) 7 P/(x), and rotation by n around the y-axis changes 

e lJth Y w = (-!)' (13.16) 
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On the other hand C operation on a neutral system yields C = (— 1) L+S [see 
Eq. (9.177)]. Consequently, G-parity of a neutral system is G = (— l) i+s+; . □ 

Examples of the C Parity Selection Rule 

(1) jt production in ~pn system: The system has 1 = 1. For slow ~p stopping in 
deuterium, the reaction takes place in the S (I = 0) state. Then an even (odd) 
number of pions are produced in the decay from 3 S x ( x S ) states. 

(2) p -> 2jt: 

p ± (l~)^ jt ± jt .-. G p = + -»- p -*» 3?r (13.17a) 

o)(l~) -+ Jt + Jt~7t° :. G a = - ->■ to -** 2jz (13.17b) 

Problem 13.2 

Main decay modes of the ?; meson (m, ; = 548 MeV, r n = 1.3 MeV) are 

jj -*■ 7t + + Jt~ + n° 22.73 ± 0.34% 

-> Ji° + ji° + ji° 32.56 ± 0.23% 

-* 2y 39.31 ± 0.20% 

->• it + + jt~ + y 4.60 ± 0.16% (13.18) 

All the decays are via electromagnetic interaction. Why? Considering C, P, J selec- 
tion rules, show that J FC of t] is h if J < 2. 



13.3 

:tN Interaction 

13.3.1 

Isospin Conservation 

Charge Independence of jtN Coupling As the pion is a spin scalar in real space, 
the simplest Lorentz invariant interaction can be expressed as 

Mim ~ igfy 5 ip7t 2| (13.19) 

In isospin space, an obvious invariant that can be made from the two vectors xprip 
and it is lj)t ■ nip. The interaction Hamiltonian is now 

Xmr = igxpy^x-Jtyp 

= ig[(np + pn)Ki + i{np - pn)n 2 + (pp - nm)n^\ 

= ig[V2(pnjz+ + npjt-) + (pp - nn)jz°] (13.20a) 

2) (flm x ) 1 {pyi'y 5 ipd fl ji is equally valid, but in the low-energy limit it is equivalent to the 
pseudoscalar interaction of Eq. (13.19) with / = g(m„/2m p ) [refer to the low-energy 
approximation of the Dirac bilinears in Eq. (4.2)]. 
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(13.20b) 



JT 1 + ijT 2 + JT l -ijT 2 

7T = — , Jl^ — — , JT " = JT 3 

V2 V2 

where in the second and third lines we have omitted the space-time variables as 
they play no role in the following discussion. The last line of Eq. (13.20a) means 
that the strengths of the interactions 



>p + x 



(13.21) 



are proportional to 

V2:V2:1:-1 (13.22) 

The charge independence condition Eq. (9.193) V pp = V nn = V pn is derived from 
the above relation. The argument goes like this. The strength of the potential be- 
tween nucleons is expressed in Fig. 13.7. 



...?1 


9 -9 




-9 9 


7C° 


-9 \/29 


.-.. 



(a) (b) (c) (d) 

Figure 13.7 The strengths of (a) V pp , (b) V„„ and (c,d) V p „ 



Fig. 13.7a Exchange of ji° V pp oc g 2 [(l) x (1)] = g 2 

Fig. 13.7b Exchange of ji° V nn a g 2 [(-l) x (-1)] = g 2 

Fig. 13.7c Exchange of ji° V pn a g 2 [(l) x (-1)] = -g 2 

Fig. 13.7d Exchange of jt ± V pn a g 2 [(V2) x (V2)] = 2g 2 

Since pp and nn are both 1 = 1 states and symmetric with exchange of the parti- 
cles 1 and 2, the 1 = 1 part of the potential becomes 

V(c) + V pn (d) = g 2 = V pp = V nn (13.23) 

which shows the charge independence of the nuclear force. 



isospin Exchange Force In Section 6.8 v 
fermions scatter by exchanging a scalar n 
Eq. (6.157) 



iw that when two distinguishable 
i the scattering matrix is given by 



= (IJifd^pr + p 2 
x u( Pi )ru( Pl )- 



^u(p 4 )ru(p 2 )\ 



456 I 13 Spectroscopy 

where T = 1 (iy 5 ) for a (pseudo) scalar meson exchange. In the nonrelativistic 
limit, the scalar meson propagator becomes —g 2 /(q 2 + fi 1 ), which is the Fourier 
transform of an attractive Yukawa potential. We now extend the discussion to in- 
clude the isospin degrees of freedom. The interaction including the isospin is given 
in Eq. (13.20): 

Xmi = g^rt-ji^ = gY J ^rT i ^jt i (13.25) 

where ip is now a 2 x 1 matrix in the isospin space. Then Eq. (13.24) is modified to 

T fi ~ u(p 3 )rT 1 , i u[p 1 )-^-^u( P4 )rt 2tj u(p 2 ^ _ (13.26) 

where we have omitted the d -function. We see the only modification due to inclu- 
sion of the isospin is 

q 2 -fi 2 q 2 -H 2 
where we have omitted the isospin wave function and expressed the scattering ma- 
trix as a 2 x 2 matrix in isospin space. The subscripts 1, 2 were attached to denote 
that the r matrix acts only on particle 1 or 2. 

Now, a system of two identical fermions must be odd under permutation of par- 
ticles 1 and 2. Since we have included the isospin degree of freedom, regarding the 
nucleon (p, n) as identical but in different states, the permutation must be done on 
all three kinds of wave functions: 

(1 o 2) ~ (isospin) x (spin ) x (spatial wave function) (13.28) 

Since the two nucleons both have I = 1/2, the total isospin is either 1 or and 
symmetric or antisymmetric accordingly. The isospin wave functions are given by 

'\pp) k = +l 

-^ { \pn) + \np)) h= (13.29a) 

\nn) I 3 = -l 

I=I 3 = -L(|pn)-|nj>» (13.29b) 

The value of (/| ri • r 2 |i) is 

^-(^-©'-©'-^-'■■KH 



The deuteron, a bound state of (p n), has I = 0, S = 1 and only the even orbital 
angular momentum I = 0, 2, . . . states are allowed, which is indeed the case. It has 
a dominant S (I = 0) state with a small mixture of D (I = 2) state. 
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For I = 0, t\ ■ T 2 < and the sign of the potential is reversed. Namely, the 
scalar meson exchange gives a repulsive force and cannot make a bound state if 
the isospin degree of freedom is taken into account. However, if the exchanged 
meson is a pseudoscalar, Eq. (13.26) gives an attractive potential and can explain 
the existence of a nuclear bound state. Since it takes some extra algebra to show 
that the pseudoscalar exchange indeed gives an attractive force, explicit calculations 
will be deferred to Section 13.6. 

The appearance of the isospin exchange force (~ t\ ■ Tj) is due to the exchange 
of isospin carried by the pion. Whenever the force carrier has a quantum number 
that is described by S U(N), N > 2, a similar exchange force term appears. The 
color degree of freedom carried by the gluon induces a color exchange force in 
S U(3). and it played a crucial role in disentangling the hadron mass spectrum 
and in proving the color charge to be the source of the strong force, which will be 
discussed in detail in Section 14.6. 

33 Resonance (A (1 232)) In the early days before the discovery of quarks, hadrons, 
including pions and nucleons, were considered elementary and their dynamics was 
extensively studied through their reactions. Figure 13.8 shows one example of such 
investigations. One immediately notes a rich structure in their behavior and many 
resonances have been identified. Today, they are recognized as superpositions of 
excited energy levels of quark composites. But discovery does not happen in a day. 
Given this kind of data, it is necessary to isolate levels, identify their properties 
and classify them to make a Mendeleev periodic table of hadrons. Namely, finding 
regularities among the many levels is the first step in reaching a deeper under- 
standing of the structure. This is the process called hadron spectroscopy. Before 
launching a systematic investigation, we pick out the most conspicuous resonance 
at M = 1232, which used to be called the 33 resonance, and apply the algebra 
of isospin symmetry to show how it works. There are eight combinations in jzN 
scattering, but we pick out three easily accessible reactions in the following: 

Ji+ + p^ji+ + p (13.31a) 

jt~ + p ->■ jt~ + p (13.31b) 

ji~ + p -► jt° + n (13.31c) 

Expressing only the isospin indices, we write the scattering amplitude as 

(/l T l i > = EE</l ;; 3>(I/3|T|J'J 3 '>(l'/3'|i> (13.32) 

If charge independence is valid, the total isospin J and h are conserved. The dif- 
ference of the J 3 is that of the electric charge, and so J 3 conservation follows from 
electric charge conservation, too. Just like rotational invariance in real space, the 
scattering amplitude can be decomposed into partial amplitudes of definite isospin. 
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MASS OF 7i-p SYSTEM (MeV) 



Figure 13.8 Resonance 
gy jr ± -p scattering data. T 
/ = 1/2 and A for/ = 3/ 
number in parenthesis is it 
D, F, G, and H stand forth, 
and 5 states. D15 means L 
J = 5/2. Some resonance 



two or more states. N(1680) may be an over- 
lap of N(1675) D15 and N(1680) F 15 . Simi- 
larly, A (1920) may be an overlap of A (1920) 
P33 and A (1930) D35. N(2200) may also be 
an overlap of (N(2190) G17 and N(2220) 
H 19 )[156]. 



The Wigner-Eckart theorem tells us that 

(II 3 |T|I'J 3 ') = <5„.<5 M (I|T|J) - d U 'd hIi T(I) 



The transformation matrices (I h\i) can be obtained from the Clebsch-Gordan 
coefficient table (see Appendix E). Let us first rewrite the \jtN) states in terms of 

"3>: 



1)=\* + P) 



(13.34c) 
(13.34d) 
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\h 1 i) = ^ 3T+n) -^ 7rap) (1335a) 

|i'4) = /j , *° B) -'/f , ' ' > (13 - 35b) 

Solving the above equations in terms of 1 1 1 3 ), we obtain 3 ' 

\ Jt+ P) = \H) < 13 - 36a ) 

AT" 1 3 i\ /III i\ 

^^mi+^ih) (i3 - 36c> 

Using the Wigner-Eckart theorem 

(jt+ ^ it|jt+ ^ >= (M| t |M) =t (D (i3 - 37a> 

«[£M)-£M)] 
-K r (l) + "(0] 

-#G)- r G)] 

The three amplitudes (or all eight amplitudes) are written in terms of two indepen- 
dent amplitudes T(3/2) and T(l/2). Combining the three equations in Eq. (13.37), 

-(jt + p\T\jt + p) + (n~p\T\n~p) + V2{n°n\T\7t~p) = (13.38) 

The three amplitudes make a triangle in the complex plane. If we express the cor- 
responding cross sections by da + , da~ , da , Eq. (13.38) can be rewritten as 



V do~ — V2dff°| < V da + < V da~ + \l2da° 



3) The Clebsch-Gordan coefficients mak 
vertically instead of horizontally, the in 
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As the cros 
expressed a 



I- 1 < Vda~ < V2da° + V da+ (13.39b) 

: | < J 2 do < \l do+ + ~Jdo~ (13.39c) 

e proportional to the square of the amplitude, they can be 



If! 



■G)-'G)}| 



If|T(3/2)» T(l/2) 

d<7+ : dcr" : da 
If|T(3/2)«T(l/2) 



= 0:2: 



(13.40a) 
(13.40b) 
(13.40c) 



(13.42) 



The above relations use only isospin conservation; no other restrictions apply 
and consequently they are valid for any value of dynamic variables such as the 
scattering angle or the energy as long as the comparison is made with the same 
condition. The data of a conspicuous resonance z1(1232) at the total energy ^/s = 
1100 ~ 1300 MeV or the kinetic energy of the pion 100 < T n < 300 are shown in 
Fig. 13.9. The measured total cross sections at the peak are 



210 mb: 24 mb : 48 mb 4) 



(13.43) 



which shows that A (1232) has I = 3/2. On the other hand, A (1232) is observed 
in Jt + p, Jt + n, Jt~p, and n~n channels having electric charge (++, +, 0, — ), mak- 
ing a quartet, and J = 3/2 can be inferred without the above demonstration. But 
the fact that the cross sections were just as predicted is strong evidence that the 
hypothesis of charge independence is valid in n N reactions or more generally in 
the strong ii 



n by the absence of 



Other Tests Other evidence is 

d + d ->• 4 He + jt° 



The deuteron and helium are isosinglets (1 = 0), and the pion an isotriplet (I = 1), 
so the above reaction, if it exists, violates isospin conservation. 
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Problem 13.3 

Helium 3 He and tritium 3 H make an isospin doublet. Show that 

do(pd ^ 3 He7T°) 

Problem 13.4 
Show that 



da(pp ->• djt + ) 
da(np -* djT°) ' 



Problem 13.5 

[K + , K°), (I + ,S°,i:-) make I = 1/2 and 1 multiplets. Writing the amplitudes 

for 



Problem 13.6 

(K + , K°), [3°, S~) make I = 1/2 doublets. Consider the 

K~n -+ K°a~ , K~ p -» K + S~ , K~ p -+ K°E 



462 I 13 Spectroscopy 

and show the relation among the amplitudes 
f(K°3-)= f(K+~-)+f(K°E ) 

Problem 13.7 

A is an isosinglet particle. Show that 

do(ji + n -+ K+A°) = da(7i-p -* K°A°) 



Fermi- Yang Model If we make an isotriplet from two isodoublets (p, n) and 
(— n, p) we have 

nn), \1,-1) = np (13.52) 

These states have baryon number 0, isospin 7 = 1 and the same quantum number 
as the pions (jr + , jr. , 7t~). Therefore the idea that the pions are bound states of a 
nucleon and an antinucleon was proposed by Fermi and Yang [139]. This was the 
predecessor of the Sakata model, which extended the S U(2) Fermi- Yang model to 
S 1/(3), taking (p, n,A) as the three basic constituents of all the hadrons, which in 
turn developed into the successful quark model. 

13.3.2 

Partial Wave Analysis 

A scattering process can be decomposed into a sum of contributions with definite 
angular momentum/, which is called partial wave expansion. Even if a resonance 
is observed at a certain value of/, there are contributions from other partial waves, 
which may not form resonances. Generally they are slowly varying backgrounds as 
a function of the energy. If the energy is high and many inelastic channels consist- 
ing of more than one jt and a nucleon are open, the contribution of the resonance 
may be only a small fraction and may not appear as a sharp peak. To find reso- 
nances in such a case, a partial wave analysis taking into account other contribu- 
tions is necessary. The starting point is the partial wave expansion formula derived 
in Chap. 9. We reproduce them here: 

^=I/W„W 2 1 2 (13.53a) 
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(13.53b) 

= ^- J2( 2 J + WW[S J W - lPxlz)^)^-"^ 

(13.53c) 
P\]XX l l 1 ) = V\Vi(-^) 3 ~ s '~ S2 \J^.-h,-^i) (13.55a) 

{-k 3 ,-l A \S } \-ki,-k 2 ) = ?/ s (A 3 l4|S ; |liA2) (13.55b) 

rjs = »7i'72'73'74(-l) S3+S4 - sl - S2 (13.55c) 

(-A 3 ,-A4|/(e,0)|-Ai ( -A 2 ) = j/s(A 3 A4|/(6>,7r-0)|AiA2) (13.55d) 

where all the kinematical variables are to be evaluated in the center of mass frame. 
P is the parity operator and 1;, '7; are heliciry and intrinsic parity of the particle i 
respectively. As the nucleon and the pion have spin 1/2 and respectively, we set 
X\ = 0, X 2 = ±1/2. In the following we write ± instead of ±1/2 for simplicity. 
According to Eq. (13.55b), we have 

S ] ++ = Sl_ , S ] + _ = SL + (13.56) 

Denoting the orbital angular momentum as I, we have I = J ± 1/2. Considering 
that 71 has intrinsic parity — 1, the parity transformation formula (13.55a) becomes 

P\JM; ±) = -(-1)- / - 1/2 |JM; T) (13.57) 

where J, M denote the angular momentum and its component along z-axis. Then, 
eigenstates of the parity operator can be constructed as 

= |JM;+) + |JM;-) 
. , IJM; + )- 2 |JM;-) (13 ' 58) 

and their parity is given by 

P|;±) = ±(-l) J - 1/2 \J±) = -(-l) J±1/2 |/±> (13-59) 

Equation (13.59) means \J±) corresponds to I = J ±1/2. According to Eq. (13.56), 
there are only two independent amplitudes for a fixed J. Two L's with the same J 

5) The relation with the scattering matrix is given by Eq. (6.35): 

Sj, = if, -iTj, = d fi - i(2 J r) 4 d 4 ( J2 Pi ~ J2 Pf) M f- < 13 - 54 > 



(JM;+|S -1| JM;-) _ S|_ - 1 _ f } _-f }+ 



2 

/;- 


+ fu 


s 1 


2 
_-l 
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state differ by 1 and hence correspond to states of different parity. If parity is con- 
served, they do not mix. Therefore, partial matrices made of states ]^ are diago- 
nalized. From the unitarity of the S-matrix, we have 

S3- = e 2i& i-, S 3 + = e 2id '+ (13.60) 

where dj is referred to as the phase shift. When inelastic channels are open, the 
unitarity condition changes. If we are considering only elastic scattering channel, 
the effect appears as attenuation of the flux. It can be taken into account by consid- 
ering a complex phase shift that has a positive imaginary part. We define 

e 2 l6j = fmdj+ilmdj) = ^HWj (13 61) 

Then, using Eqs. (13.58), we have 

(JM;+\S-1\JM;+) = S J ++ -1 _ e 2id - + e 2ld + - 2 
lip 

f, 4- f, 

(13.62a) 

(13.62b) 

f J± = ^e' d J± sin d ]± (13.62c) 

which defines the partial scattering amplitudes. Inserting Eqs. (13.62) in 
Eq. (13.53c) and rewriting d J . using formula in Eq. (E.22), we obtain 

/++ = £(//- + fj+) cos y[ p ;+i/2(0)- P'j-i/im] (13.63a) 

/+- = T,(f]- ~ /i+) sin f [ P ]+i/2( )+ Pj-yAO)]^ (13.63b) 

P (X) = 1 , P 1 (X) = X , P 2 (x) = ^^ , 

fi(*) = 5X ~ 3X .- (".63d) 

where Pl(6>)'s are Legendre polynomials. The other two amplitudes can be ob- 
tained from Eq. (13.55d). Without loss of generality we can set = and we have 

/— = /++ - /-+ = -/+- (13-64) 

In ordinary experiments, helicity states are not observed. In this case, the average 
of the initial and sum of the final helicity states are taken to give 



— = -Y\f u x\ 2 
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Polarization of the Nucleon If the target nuclei are unpolarized, nucleons in the 
final states of JtN scattering are polarized perpendicular to the reaction plane. The 
reasoning is as follows. For fixed z-axis, there are two spin degrees of freedom 
in both the initial and final states. Consequently the scattering amplitude can be 
expressed as a 2 x 2 matrix. Any 2x2 matrix can be expanded in terms of a unit 
matrix and Pauli's matrices a. The rotational invariance of the interaction restricts 
them to be scalars. Since the only available physical quantities in the center of mass 
frame are initial momentum p i and final momentum p r, the scalars that can be 
made are limited to 

1, o-Pi, o-Pf, <r-Pi x Pf (13.66) 

a- p ; and a- p j change sign under the parity operation and hence do not contribute 
in parity-conserving reactions. Only the terms 1 and a- p t x p ? are allowed and the 
former does not change the polarization of the nucleon. From the latter, we know 
that the polarization generated by the scattering is perpendicular to the si 

If we set 0=0, the scattering occurs in the x-z plane and the polar 
in the y direction. If we adopt the z-axis along the final momentum direction, the 
eigenstates of a y can be constructed from the helicity eigenstates in the following 



U> = ^(l+>-i|-» 
The probability of the spin being upward (s y = 1/2) is given by 

^(t) = ^Ei<ti/wi 2 = jEi^-^i 2 

* l (13.68) 

= Im(/;_/ ++ ) + l(|/ ++ | 2 + |/ + _| 2 ) 

As the probability of the spin being downward can be calculated similarly, the de- 
gree of polarization is given by 

da m -dad) 2Im(/;_/ ++ ) .„ m 

The polarization gives information independent from that given by the scattering 
cross section. For instance, as is described in the following, the angular distribu- 
tion of scattering cross sections having the same J but different I is the same and 
cannot be distinguished for different I or equivalently different parity states. Then 
a polarization measurement can resolve the degeneracy. 



466 I 13 Spectroscopy 

Differential Cross Section When only a specific J contributes, the unpolarized 
cross section is given by 

^ = (i + yV; ± l 2 [(4 2 , 1/2 ) 2 + (<- 1/2 ) 2 ] (1370) 

= 2jz(2J + l)\f ] J 2 I ] (0) 

where we used Eq. (E.18) and defined lj(0) as a normalized angular distribution 
such that integration over \n solid angle gives 1. When J = 1/2, fj_ corresponds 
to I = and fj+ to L = 1. If only a single partial wave contributes, we have, using 
Eqs. (13.63), 

Si/2. /++ = /i / 2-Cos(0/2), /+_ = / 1/2 _sin(0/2) 

Pip ■■ f++ = fi/2+ cos(0/2) , /+_ = -f 1/2+ sin(0/2) ( '' > 

Using the notation commonly used in spectroscopy, 6 ' the differential cross sections 

are expressed as 

Sl/2 : w = l/++|2 + 1/+ -' 2 = l/l/2 -' 2 (13 ' 72a) 

An 

(13.72b) 



Pip: 


dQ 


-\hl2 + \ 2 






Similarly using Eq. (E.22) 
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(13.72c) 
(13.72d) 

(13.72e) 

From the above formula and also from Eq. (13.70), we see that the angular distri- 
butions are solely functions of J and are independent of parity or I for the same /. 

13.3.3 

Resonance Extraction 

Reactions between pions (tz) and nucleons (denoted as N, not to be confused with 
n to denote neutrons) are among the best investigated phenomena. This is because 
the jt meson is light and an intense pion beam can easily be made by bombarding 
any material with protons with energy higher than 300 MeV. Resonances, which 
appear as large peaks of the scattering cross sections, were of immediate interest. 
With the advent of (then) high-energy intense accelerators, together with the in- 
vention of a powerful detector, the bubble chamber, (see Fig. 14.2 for more detailed 
description) many hadronic resonances were discovered and investigated in the 

6) S, P,D,F,... denotes I = 0, 1,2, 3 

7) Astheantim 
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1960s. Although the resonances as well as the pion and the nucleon (collectively 
referred to as hadrons) are actually composites of quarks, we will stick mostly to 
the terminology used in spectroscopy in this chapter. The composite nature of the 
hadrons will be treated in the next chapter. Kaons and hyperons Y (baryons that 
carry strangeness quantum number) were also investigated in this era. Apart from 
selection rules arising from the strangeness, kinematical structures of KN or n Y 
are similar to those of jzN. Therefore, we mostly concentrate on jiN phenomena. 
Resonances in the JiN system denoted as N*, N **,••• are dynamical enhance- 
ments of many particles in scattering states. Like those of atoms and nuclei, higher 
resonances quickly descend to lower energy levels, eventually settling in the ground 
state as a nucleon and mesons. 7 ' Namely 

(13.73) 



t + N -+ N** ->■ N* + nn -+ N + n 



Because of baryon number conservation, N** and N* have the baryon number 1. 
It is a number assigned to the nucleon, hyperon and their excited states. In con- 
trast, mesons have baryon number 0. A general pattern of JiN reactions is depicted 
in Fig. 13.10. Hadronic resonances typically have widths (f) of 10-100 MeV, which 







s, typically on the order of 



'eakly decaying particles (K, A, 2, 
have lifetimes of 1CT 10 -10~ 8 s are 
as stable in strong interactions. 



means their lifetimes are x ~ fr/f ~ 10~ 22 -10~ 23 s. 8) At flight velocity near the 
speed of light their flight distance before decay is at most ~ 10 -15 m, compara- 
ble to hadron size and far below observable limits. What is observed is debris of 
decayed resonances. However, energy-momentum conservation helps us to recon- 
struct their mass. We define the invariant mass M of a many-particle system by 



M 1 



(Pi 



••) 2 = (E 1 +E 2 + ---) 2 -(Pi+J>2 + --T 



(13.74) 



where pt's are four-component momenta of particles. Therefore, one method to 
find the resonances is to pick up mil or mn + N out of an n-particle final state 
in the multi-pion production reaction, plot the number of appearances as a func- 
tion of their invariant mass and see if a peak or peaks appear. In the special case 



8) i 



le flying ; 



light s F 



i 



seofLorei 



re considering only the order of magnitude. 
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where resonances can be formed by an incoming n (particle 1) and a target nucleon 
(particle 2) (Fig. 13.10c), the invariant mass is given by 

M 2 = (pi + nf = s = (Ei + m) 2 - V \ 

= (m 1 + m 2 ) 2 + 2m 2 T 1 (13.75) 



Here we assumed the target is at rest and used p 2 = 0. A plot of scattering cross 
section as a function of the invariant mass immediately indicates the existence of 
a resonance as a peak or bump (see Fig. 13.8) and its mass can be determined by 
Eq. (13.75). To sort out many resonances, their quantum numbers, including spin, 
parity and isospin, have to be determined. 

Resonance Behavior Let us look at the behavior of a resonance when it is a dom- 
inant contribution to the scattering amplitude. Since the resonance is an unstable 
intermediate state, we may treat it as a particle that has energy with negative imagi- 
nary part r/2 [see Eq. (2.14)]. When a resonance intermediate state | R) dominates, 
perturbation expansion of the scattering amplitude in quantum mechanics gives 



' ^ En - E„ 



E -(E R -i^) Eo-(E R -i$) 

(13.76) 

Here r f ~ |(/|H ; |R)| 2 , T, ~ \(i\Hi\R)\ 2 are partial decay widths to the states 
| /), \i) and the total decay width is given by 

The elastic scattering matrix for a single partial wave can be expressed as 

S = ^ = ^±± (13.78) 

cot a - i 

Let us expand cot d in Taylor series in the neighborhood of E R , which gives d = 

jt/2, 

cotd=cotd ER + (E-E R ) -Jj- +'" < 13 ' 79 ' 

dE \e r 

At E = E R the real part of the denominator vanishes, we can put 

dcotd\ dd\ 2 



The scattering matrix can be approximated a 
E-E R - i(r/2) 
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which results in the scattering amplitude 
tame 1 (772) 

m ~ ~T~ " "p£-£, + i(r/2) (13 ' 81) 

This has the desired form to reproduce a resonant shape of the scattering cross 
section. In other words, the resonance occurs where the phase shift <5 j of a partial 
wave becomes ji/2 ± nji. As can be seen from the definition of F [Eq. (13.80a)], 
if the phase shift as a function of energy increases rapidly in the vicinity of jt/2, r 
becomes small, giving a sharp peak. 

Unitary Limit For each partial wave, the range of value S-I = e 2,d J is constrained 
to | e 2,d ] | < 1 and we have the following unitary limit for the scattering amplitude: 



(13.82) 



This is equivalent to saying that the probability should not exceed 1. The 
section of a partial wave for an unpolarized target in the vicinity of 
given by 

do _ 2jz(2J + 1) (772) 2 

dQ p 2 (E-E R ) 2 + (r/2) 2 J[ ' 

For inelastic scattering (i ^ f) it becomes [see Eq. (13.76)] 

do _ 2jt(2J + 1) (/>/2)(/V2) 



dQ p 2 {E -E R ) 2 + (r/2y 

In particular, for J = 3/2 

do _ 8jt (///2)(/y2) / 1 + 3 



l ; (0)" 



dQ p 2 (E-E R ) 2 + (r/2) 2 



j (13.86) 

From Eq. (13.83), we see that the resonance occurs at E = E R and when no inelastic 
channels are open it can reach the unitary limit. 

Spin and Isospin of A (1232) A(1232), denoted as A in this chapter, appeared in 
the jtIV scattering cross section as the first prominent peak in Fig. 13.8: 



Its isospin is 3/2 since four charged states (A ++ ,A + ,A°,A ) were observed. 
In order to determine its angular momentum/, we compare the total cross section 

aged jtN scattering. More generally, for incident 
resonance formula is given by 



4jz(2J + 1) 1 (///2)(r,/2) . 

" (2s„ + l)(2s b + 1) ^ (E - E R f + (r jlf 
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with the unitary limit. As it occurs around T ~ 180 MeV, which is low enough to be 
able to ignore the inelastic channel contribution, we may approximate 7j = // = 
F. Then the cross section should take the value of unitary limit at the resonance 

Figure 13.11a [264] shows that the unitary limit is reached almost completely, 
which determines the spin of A (1232) to be J = 3/2. Then, the angular distribu- 
tion should be that of / = 3/2 given by Eq. (13.86). The data [31] (Fig. 13.11b) at 
T = 170 MeV confirm this. As previously explained, the angular distribution at the 
resonance alone cannot distinguish the I value or parity of A , 

Pa = P*P P {-1) L = (-1) L+1 (13.88) 

but considering the interference effect with other partial waves, it can be deter- 
mined. Classically, the orbital angular momentum corresponds to L = r x p, 
where r is restricted by the nuclear force range (|r| < R), which is known to 
be ~ 10 -13 cm. At the A region, the pion momentum is ~ 300 MeV/c, and on- 
ly L = 0, 1 are significant. Thus we are led to conjecture that A is P3/2 with 1=1. 
To confirm this, let us consider how the angular distribution is affected if there is a 
small L = background, which is expected to vary slowly. Using Eqs. (13.63), the 
scattering amplitude including both S (I = 0) and P (L = 1) wave contributions is 
expressed as 

da ~ |S| 2 + P| 2 (l + 3cos 2 6») + 4Re(S*P)cos0 (13.89a) 

On the other hand, if we assume S (I = 0) and D (L = 2), the expression becomes 

do~ \S\ 2 + D| 2 (l + 3cos 2 0) + 4Re(S*D)(3cos 2 -1) (13.89b) 

From Eqs. (13.89), we see that the interference term gives cos 6 dependence and 
produces asymmetry for the assignment A = P, but no asymmetry for D. In gen- 
eral, asymmetric terms arise from interferences between waves of opposite parity. 
The resonance amplitude is expressed as 

1 _ E - E R - i(r/2) 

* ] ~ E - E R + i(r/2) ~ (E- E R ) 2 + (f/2) 2 <13,90) 

and its functional form is described in Fig. 13.12. Assuming the S wave does not 
produce a resonance in this region, its amplitude is a slowly varying function. Then 
the interference term with the real part of the resonance amplitude changes its sign 
below and above the resonance energy. Angular distributions at T jr = 98, 310 MeV 
shown in Fig. 13.11b behave exactly as expected for the existence of cos 6 interfer- 
ence, which vanishes at the resonance but produces opposite asymmetries below 
and above the resonance, confirming our previous conjecture that I = 1. In sum- 
mary, the spin parity of A is determined to be J p = (3/2)+ . 
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.i-Tx = 310 MeV 



30 60 90 120 150 180 
CM angle in degrees 



Figure 13.11 (a) The production cross sec- (T„ = 180 MeV, (s = 1232 MeV) 2 ) 

tion of A (1232) reaches its unitary limit angular distribution is — 1 + 3 cos 2 6 

%jif- = Sjtjp 2 [264]. (b) Angular distri- and above the resonance, the angular 

butions of the jiN system in the neighbor- bution is asymmetric due to interferer 

hood of A (1232). At the resonance peak S-wave amplitude (see text) [23, 31, 91 
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and background amplitude a- 



13.3.4 

Argand Diagram: Digging Resonances 



Not all resonances are as conspicuous as zl(1232). Some of them appear as peaks 
and bumps, but many are hidden behind conspicuous resonances or overwhelm- 
ing backgrounds that come from inelastic channels. To disentangle many overlap- 
ping resonances and backgrounds, it is necessary to make a detailed partial wave 
analysis as a function of energy and angles. If it is assumed that the maximum 
orbital angular momentum is limited by I max — pR, R — l/w a , the scattering 
amplitude can be expressed as an finite number of partial waves. When the cross- 
section as well as polarization data are fitted as functions of energy and angle, every 
partial wave scattering amplitude (Aj) can be determined, at least in principle. Let 
us examine how these partial amplitudes behave. If inelastic channels are open that 







Figure 13.13 A complex partial-wave ampli- 
tude lies within a circle of radius 1/2 (unitari- 
ty). For an elastic channel, it is located at the 
origin when the kinetic energy is zero. As the 
energy increases, it moves clockwise or coun- 



terclockwise depend 



m whether the force 
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damp the elastic waves, its effect can be taken into account by introducing elasticity 
as defined in Eq. (13.61). Then the partial-wave scattering amplitude can be written 



(13.91) 



On the Re(A j)-Im(A j) plane, A j is confined to within a circle of unit radius with 
its center located at x = 0, y = 1/2. This is called the Argand diagram. This 
is another expression for the unitary limit. Starting at the origin when the kinetic 
energy T = 0, Aj proceeds on the circle clockwise when the force is repulsive 
and counterclockwise when attractive. As the rotation angle is given by 2d and its 
distance from the center by t]] /2, the track departs from the circle when inelastic 
channels such as 2jz + N, 3ji + N, ■ ■ ■ are open (see Fig. 13.13). When it crosses 
the imaginary axis counterclockwise, 6 j = jt/2, it becomes a resonance amplitude. 
When the speed at crossing is fast, the resonance width is narrow [see Eq. (13.80)]. 
In the vicinity of the resonance, the amplitude Aj takes the form of Eq. (13.81); we 
can express it as a function of two parameters x a , x- A and e defined by 



x a = f, e = (E-E R ) T 

When many channels mix, the behavior of the resonance (in the Argand diagrams) 
is very complex (see Problem 13.8). Generally, if the track is circling counterclock- 
wise, a place where it runs parallel to the x-axis can be considered as a resonance. 
For jtN and KN scattering, there are rich stocks of data and detailed analyses have 
been carried out [204]. Figure 13.14 shows part of two independent analyses using 
different formalisms. In general, they show good agreement. We see that many res- 
onances that are not apparent in Fig. 13.8 were revealed by doing detailed partial 
wave analysis. 

Problem 13.8 

Show that resonances in the Argand diagram in Fig. 13.15 occur for the following 

cases: 

(a) Resonance amplitude only. 

(b, c) Resonance when it coexists with a background wave that has attractive force 
and the same spin-parity. 

(d) Resonance when it coexists with a background wave that has repulsive force 
and the same spin-parity. 
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Figure 13.14 Examples of jiN scattering 
partial-wave analysis. It is common to use 
N and A to denote / = 1/2 and / = 3/2 
with S (strangeness) = members, (a) 
P33 (L = 1,/ = J = 3/2) contour in- 
cludes A (1232), etc. (b) D 13 (/. = 2, / = 



1/2, J = 3/2) contour includes N(1520), etc. 
(c) S„ (L = 0, / = 1/2, J = 1/2) contour 
includes N(1535), etc. (d) S 31 (L = 0, / = 
3/2, J = 1/2) contour includes A (1620), 
etc. [204]. 




13.4 
P (770) 

Mass and Width p was the first resonance discovered in the Jtn channel. Three 
charged states are known 

p+ ->• jt + + tc° , p° -+ sr+ + jt~ , 10) p~ -+ n~ + it (13.93) 

which means the isospin of p is I = 1. If real nn scattering is possible, its spin and 
parity could be determined in the same way as Zl(1232). ji is an unstable particle 
and cannot be used as a real target, but there is a way to make one effectively. Due 
to the Heisenberg uncertainty principle, a proton can emit and reabsorb pions as 
follows: 



p 4> p + Ji 



(13.94) 



The transition is an effect of strong interaction and occurs frequently. Consequent- 
ly, the proton can be considered as surrounded by a cloud of pions. which could 
serve as the target. If we can shoot pions at the cloud, we can make ttjt scattering 
happen. A necessary condition is that the time for which a jt is virtually free is long 
compared to the collision time. Referring to Fig. 13.16, this means the contribu- 
tion of diagram (b) must dominate. Using (q,fi), (p p , M), (p„, M) to denote the 
four-momenta and masses of the target n, proton and neutron, respectively, the 
condition for the pion to be free is 

q 2 = ?u? "=// 2 (13.95) 
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(a) (b) 
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Figure 13.16 Diagrams cc 
contributions (c,d). 


ntributing 


In a frame where \p 


= \Pn\ 


neutron are the same, 




E p = E n 




q 2 = (p P ~Pn 


2 = ( V 



s of the proton and the 



E n) ~ {P p ~ P n ) = 2p 2 {l - COS 0) < 



(13.96) 



q 2 is negative, meaning it is not free but a virtual particle. The difference between 
the mass of the free and the virtual particle dft = ^ fx 2 — q 2 indicates the degree 
of virtuality. From the uncertainty principle, the time duration for which the target 
pion is free may be measured by Tf ree ~ 1/dft. q 2 cannot be made positive. Then, to 
make rf ree large, i.e. dfi as small as possible, \q 2 \ must be small, or from Eq. (13.96) 
we have to make cos 9 -> 1. Considering the requirement that the target n should 
be well away from the nucleon, in other words, the space extension of the virtual n 
should be much larger than the size of the proton R ~ 1/fi 



Pn\ ~ ^P' 



0)< 



(13.97) 



we see Tf ree ~ l/<5/< ~ l//u. The collision time r co u is given by the time necessary 
for a particle to go through the target. Since the target looks thin because of the 
Lorentz contraction, it is given by Tf ree ~ R/cy ~ (l/fi)(E in /M)~ 1 , where £ in is 
the energy of the incoming n. Therefore conditions for t co u <SC Tf ree are given by 
Eq. (13.97) plus 



Ein 



M 



If they are satisfied we can a 



(13.98) 
sider the concept of the virtual target to be valid. 11 ' 



1 1) The objection may be raised that because of 
the Regge exchange dominance in the high- 
energy forward scattering (see Section 13.7) 
the exchanged particle may not be limited 
to a single jr. The reason justifying the 
dominance of diagram (b) in Fig. 13.16 
;s from the experimental observation 






Since there is a factor Jl difference 
between jr+ and jr° in the Clebsch-Gordan 
coefficients of 71 N coupling, the observed 






Therefore, when performing the experiment 

71+ P ->■ Jl + + 7T° + p (13.101) 

which includes two jt's in the final state, if we set the incident n energy sufficiently 
high, and choose events with small recoil angle of the target proton, we are doing 
the pseudoexperiment 

Jl + + JTtgt -» JT + + JT (13.102) 

where 7r tg t is ji's emitted from the proton. Once the experimental conditions for nn 
scattering are settled, the determination of the mass and width of a Tin resonance 
is straightforward. All we have to do is to plot the cross section as a function of 
incident energy and measure the position and width of a peak. The observed values 




-1.0 -0£ -0.2 0.2 0.6 1.0 
Cos Pit* ir° 

Figure 13.17 Angular distributions in the tribution fits with cos 2 6. The cross sect 

di-pion CM frame of the reaction ji+ p -+ not zero at 6 = 90° because of nonres 

p + p ^ Ji + + Ji° + p. (a) 640 < V?< 680, contributions. Note the absence of the < 

(b) 700 < V? < 780, (c) 790 < JH < 820. term below and above the resonance [8! 
AtV* = m p ~ 740 MeV, the angular dis- 
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m(p) = 775 MeV 
r = 146 MeV 



Spin In a similar manner to the case of A (1232), by choosing the z-axis along the 
incident jt+ direction, we can set J z = L z + S z = 0, where we have used the 
fact that the spin of ji is 0. If a resonance is made with angular momentum J, the 
amplitude which the two ji's make is proportional to d^ = Pj in the center of 
mass (CM) frame of the two ji's. Pj is a Legendre polynomial and is given by 



1(6) ~ | P | 2 -constant 
~ |Pi| 2 ~cos 2 
~|P 2 | 2 ~(3cos 2 0-l) 2 



7 = 
7 = 1 
7 = 2 



When we plot the cross 
position gives m 2 The ; 
which fits well with cos' 



section as a function of the invariant mass w 2 ^, the peak 
ngular distribution at the resonance is given in Fig. 13.17, 
6 and proves J p = 1 — . 



Final State Interaction 



13.5.1 
Dalitz Plot 



In the previous s 
reaction 



e described a method to extract n 



71+ N -+ 71 + JI+ N 



(13.105) 



There, the target nucleon played an auxiliary role in providing a virtual ti target. 
A more general method to establish the existence of resonances is to make an 
invariant mass distribution of an arbitrary number of particles extracted from mul- 
tiparticle final states. Using the invariant mass, we can investigate resonances not 
only in the Tin channel but also in many tc's, jzN channels and other combina- 
tions. In order to investigate the behavior of the matrix element, we need to know 
event distributions when it is a constant, namely the distribution of phase space. 
In general, when there are n particles, the scattering cross section is given by (see 
Section 6.5.4) 

da.dr a \{f\T\i)\ 2 dLIPS (13.106a) 

dLIPS = ^(2n)^( Pi -£ Pf )f\ -^j- (13.106b) 
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where P; is the total four-momentum in the initial state and m! is a correction 
factor for m identical particles. 

Let us consider, as an example, a case with n = 3 like reaction Eq. (13.105). 
Integrating over three-momentum p 3 , the rest of the integral becomes proportional 
to 

F1F2 F v -d(E-E 1 - E 2 - £ 3 ) (13.107) 

Ei E 2 £3 

where pi = |p ; |. In the CM frame of the three particles, using p l = — (p x + p 2 ), 



where 'E, is the angle between p x and p 2 . If the initial state is unpolarized, the 
coordinate axis can be chosen in any direction, giving f dQ\ = \n and dQ 2 = 
2jrdcos§. Using Eq. (13.108), variables can be changed from (pi,p 2 ,cos £) to 
(Ei, E 2 , E 3 ). Calculating the Jacobian of the transformation, we obtain 

dptdpidcosZ = El ^ 2 f dEidEidEi (13.109) 

P1P2 

Because of the d -function, the integration over £3 is trivial, giving 

dLIPS ~ J — — — —d(E- Ei- £ 2 - £3) a dE x dE 2 (13.110) 

Namely, the phase space of the three-body final states is flat on the Ei-E 2 plane. 
Therefore, the event rate d N is given by 

dN~ |(/|T|i)| 2 d£ 1 d£ 2 (13.111) 

and the event number density on the £i-£2 plane is directly proportional to the 
square of the matrix element, which is known as the Dalitz plot. The boundary of 
the distribution is given by | cos § < 1 and using Eq. (13.108) it leads to 




instead of the particle 

by 



Similarly 

dm\ x = -2£d£ 2 
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We see the distribution on the E\-E 2 plane is directly proportional to that on the 
m 23~ m 3i plane- 
Figure 13.18 [38] shows an event distribution on the m 2 (K Ji~)-m 2 (nji~) plane 
obtained by shooting a 3 GeV K beam on a neutron target (actually a deuteron tar- 
get) and extracting K n~ n from the final state. One can clearly see concentrations 
of events corresponding to if* (892) and A(1232). Projection of events on each co- 
ordinate axis shows a sharp peak on top of a smooth slowly varying hill. The hill 
corresponds to the phase space volume and the sharp peak reflects the shape of the 
matrix element. It indicates that the matrix element M is proportional to 



(E-E R ) + ir/2 



(13.114) 



and that the resonance appears as dominant intermediate states. Similarly, the in- 
variant mass distributions of n + n°, n + n~ and n + n~ n° obtained from the re- 
action n + + p —>■ In (or An) + p are plotted in Fig. 13.19 [12]. One immediately 
sees p+(770) in Fig. 13.19a, p°(770) in (b) and ?;(548) and <y(782) in (c). 
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Figure 13.18 Dalitz plot of the reaction K~ n ->■ K°ji~n 
axis. Two resonances K* (892) in Kji and A (1232) in the : 



in each coordinate 
arly seen [38]. 
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Figure 13.19 Distributioi 
masses in the final state c 
tions. (a) ti+ + p —>■ j 

(b) 71 + + p -► 71 + + 
(C)7t++p^7l++7t + 



abscissa is the invariant mass (a) m(7t+it°), 
(b) m(7t+7t-), (c) m(7t+ 71-71°) in units of 
MeV. One can see p+ in (a), p° in (b) and 
j? (550) and <u (783) in (c) [12]. 



Thus, by observing invariant mass distributions of multiparticle final states in 
jtN and KN reactions, many resonances were discovered in the 1960s. 



13.5.2 
K Meson 






JT + + Jl + + JT~ 



(13.115) 



the final-state particles have a good symmetry in that all the particles are yr's of 
more or less the same mass. In such a case it is better to treat the three variables 
El, E 2 , £3 equally. Instead of orthogonal coordinates, one may plot them as a point 
on the inner area of an equilateral triangle as in Fig. 13.20. If we use kinetic energy 
T; = E\ — m K as variables, 



Ti + r 2 + r 3 = 



-3m 



(13.116) 



Since the sum of three perpendicular lines from a point P to the bases is equal to 
the height CC and is a constant, the point P can represent an event in K — >■ 3jt. 
We identify the lengths of three lines as T lt T 2 , T 3 divided by Q so that the sum 
is normalized to 1. The boundary is constrained by | cos || < 1. For T; <SC m a , 
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Figure 13.20 Symmetrical Dalitz plot for 
3-particle decays. Point P corresponds to an 
event with T, + T 2 + T 3 = 2, as the sum 
of the three perpendicular lines T-\, Tt, Ti to 
the bases is equal to the height CC', which is 
a constant. The abscissa measured from C is 



il to (T, - T 2 )/V3. For nonrelativistic par- 
5, the boundary is a circle (dashed line), 
t is deformed to a shape close to a trian- 
s they become relativistic. For the shaded 
>ns /, // see the text. 



it becomes a circle; this is almost the situation for K decay since the Q-value is 
75 MeV, which has to be shared among the three jz's. 



Show that the boundary in the triangular Dalitz plot of Fig. 13.20 is a circle or a 
triangle with apexes at A,B,C for T; <JC m, or T; » m„. 

At the boundary, two of the momenta are parallel or antiparallel. On the lines 
AA', BB', CC, the energy of the two particles is the same. Setting jti = ji 2 — Jt + , 
the distribution should be symmetric for 1 <-> 2 exchange, and hence we use only 
the right half of the plane. If we set the orbital angular momentum in the CM 
frame of ji+ji^~ as L+ and relative orbital angular momentum of n~ to the two 
n + CM as I_, the spin of K+ (J) is in the range 



L+-L_| <_/<!+ + !_ 



(13.117) 



Since 1+ is the orbital angular momentum of two identical spinless particles, it 
must be even. We list possible values of J in Table 13.2. If J / one of L+ , I_ is 
not zero. If 1+ > 2, and jt + n + are relatively at rest, the angular momentum barri- 
er (see the following section) would diminish the value of the matrix element. This 
happens when the energy of tt - is maximum, namely in the region I in Fig. 13.20. 
Similarly if L_ > 0, the event rate in region II, where it~ is at rest, would also 
decrease. The data shown in Fig. 13.21 [305] seem very uniform over the entire 
region and we conclude the spin of K+ is zero. 

The three n final state has parity P^(— 1) L ++ L - = —1 and was called the r de- 
cay mode. There is another decay channel, K + — >• n + jt° , which was called the 
6 mode. The parity of 6 is positive, hence r and must be different particles if 
parity is conserved. However, r and 6 had the same mass and lifetime and were 
most likely the same particle. The two contradictory facts were called the x-6 puz- 



13.5 Final State Interaction I 483 



Table 13.2 Possible values of J p forgiven (L+,L_). 



0" 2 - 

1 1+ l + ,2+,3+ 

2 2 - 0~,1~,2~,3~, 




Figure 13.21 Dalitz plot of K + -*■ K + + indicates the actual boundary. From the uni- 

jt+ + K~ . The ordinate is T3 and the abscissa formity of the distribution, the spin of the K 

is (T, - T 2 )/ V3 normalized to Q. The sol- meson was determined to be [305]. 
id line describes a circle and the dashed line 



zle. It led Lee and Yang to question the validity of parity invariance in the weak 
interaction. We now know that parity is 100% broken in the weak interaction and 
hence the parity of the K meson cannot be determined with reactions in which the 
weak force is active. The parity of K was eventually determined by kaon capture by 
helium, forming a hypernucleus in the reaction 



K~ + He 4 -+ A H 4 + ji 



(13.118) 



In the hypernucleus one neutron is replaced with a A particle having m = 
1116 MeV and strangeness S = — 1. The measurements showed that both He 4 and 
a H 4 have J = 0. The K meson is expected to be captured in the S (I = 0) state and 
the parity of Kis determined to be negative, assuming the parity of A relative to the 
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proton is positive. The indefiniteness originates from the fact that the strangeness 
is conserved in the strong interaction (see argument in Section 9.2.1). 



13.5.3 

Angular Momentum Barrier 



When the range R of a force is 
from partial waves with I ; 



s finite, classically there should be no contributions 
- p R, where p is the particle momentum or wave 
number. In quantum mechanics, however, all the partial waves have components 
at r < R and they contribute. This can be seen as follows. A plane wave can be 
expanded as 



?»" = ^i l (2l + l)P l (cos8)j l (pr) 



(13.119) 



This shows that a component of a partial wave in the plane wave is proportional 
to regular solutions of the spherical Bessel function j i(pr). Since lim p r _j. j i {p r) ~ 
(pr) 1 , at low energies where p <SC R~ x , the Ith component of the partial wave in 
the potential is proportional to (pR) 1 - It can be rephrased as follows. If a particle 
has angular momentum I, a centrifugal potential oc 1(1 + l)/r 2 is created, which 
acts as a repulsive force. When it is superimposed on the short-range attractive 
force, the potential looks like that in Fig. 13.22. An excited bound state level can be 
formed far below the peak of the barrier but with E > 0. It is an unstable state, 
because the wave can penetrate the barrier to leak outside of it (tunneling effect) 
and thus has finite probability of decaying. With the above reasoning, the decay 
matrix element becomes proportional to p l , making the decay rate r oc p 2l+1 . 
The mechanism is effective for incoming waves (scattering or absorption) as well 
as outgoing waves, and makes the scattering amplitude proportional to p 21 . Then 
Eq. (13.62c) tells us that <5; ~ p fi ~ p 11 ^ 1 . Thus, in the low-energy region (p <JC 
R~ 1 ), components with high angular momentum are highly damped, which is 
referred to as the angular momentum barrier. 




Figure 13.22 The quantum tunneling effect allows a parti 
lar momentum barrier with a damping factor ~ (pR) . 



= to leak in or out through tr 
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13.5.4 
co Meson 



The co meson was discovered in the reaction [268] 



as a relatively narrow resonance. We also have seen it in Fig. 13.19c. It was s 
only in neutral combinations and not in charged ones. It fixes the isospin I - 
The co meson has mass and width 



m m = 782 MeV , r m = 8.5 MeV 



(13.121) 



This implies the decay is strong and that the G parity is — 1. J = implie 
Since the isospin is a good quantum number in the strong interaction, it a 
the form of the wave function. As n is an isovector, the only way to construct an J = 
combination from three pions is Ii • I 2 x I 3 , which is totally antisymmetric. Then 
Bose statistics fixes the spatial part of the wave function to be totally antisymmetric, 
too. This property can be used to determine the spin and parity of the co meson 
in the Dalitz plot of the 3jt final state. We consider constructing matrix elements 
using the tensor method as we did for jt° in Section 13.2. Materials we can use are 
Ei, E 2 , £3 and p t , p 2 , p 3 . We consider three cases. 

J p = — : A totally antisymmetric scalar that can be made of three momenta is 
Pi ' Pi x Pi> which vanishes identically because p x = — (p 2 + p 3 ). Therefore, the 
distribution must be proportional to 



fcx(E 1 -E 2 )(E 2 -E 3 )(E 3 -E 1 ) 



(13.122) 



The event density on the Dalitz plot must vanish when two particles have the same 
energy, which are the lines AA', BB' and CC in Fig. 13.20, and is depicted as 
a three-dimensional plot in Fig. 13.23a. Note the difference from K and rj decay 
(Problem 13.2). For them there are no constraints from the isospin. For co to have 
a flat distribution, it has to be an isovector [398]. 

J p = \~: Because In has negative parity, we need an axial vector here. Since 
p x + p 2 + p 3 = 0,we have only one axial vector 



fccq = Pl xp 2 = p 2 xpi = p^ 



(13.123) 




Figure 13.23 Event distributi 



, 1,1+ [354,398]. 
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which is already totally antisymmetric. At the boundary of the Dalitz plot plane, 
two momenta are parallel or antiparallel, hence the density vanishes as depicted in 
Fig. 13.23b. 
J p = 1+ : A completely antisymmetric vector can be reconstructed as 



/ a Pl (E 2 - £ 3 ) + P 2 (£ 3 - £1) + Pi (Ei ~ E 2 ) 



(13.124) 



At the center £1 = £ 2 = £3, hence / = 0. For £ 2 = £3, the first term vanishes 
and the second and third can vanish if p 1 — maximum, because then p 2 = p 3 = 
— j> 3 /2. The depletion region is shown in Fig. 13.23c. We can go on to consider 
higher spins but we stop here. The interested reader can refer to [369, 398]. 

The results can be compared with experimental data (Fig. 13.24). Since all the 
distributions considered here have sixfold symmetry, it is convenient to plot all 
the points in one sector. Figure 13.24b shows such a plot and is compared with 
theoretical predictions. It is clear a> has J p = 1 — . 
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13.6 

Low-Energy Nuclear Force 



13.6.1 

Spin-lsospin Exchange Force 



g amplitude for NN interaction with exchange of a spin isoscalar 
meson was given in Eq. (6.157): 

T ft = (2jz)*d 4 ( Pl + p 2 -p i - P4 )u( Pi )ru(p l )-^- I u(p 4 )ru(p 2 ) (13.125) 

where fi is the mass of the exchanged meson. Here we investigate the exchange of a 
n meson, which is a pseudoscalar (r = iy 5 ) in space-time and a vector in isospin 
space. We have seen in Section 13.3.1 that the modification caused by inclusion 
of the isospin is only multiplication by an additional factor ri ■ T 2 , but the calcu- 
lation of the pseudoscalar exchange effect needs some algebra. Using Eq. (4.63), 
u(p')iy s u(p) becomes in the nonrelativistic limit 

u s (p')iy 5 ru r (p) = iW f i£}[y/{p'-o){p-o)- y/(p'-a)(p-a)]^ 

- i(T) fi £a-(p-v')Zr = »(*),/!>•«£, (13.126) 

where (r) /-; is a matrix element in isospin space with f, i denoting the proton or 
neutron state. The equation shows that y s interaction is effectively a derivative 
coupling, or that the pion couples with the nucleon predominantly in the P wave 
as compared to S wave, which is generally dominant in low-energy reactions. This 
is due to the negative parity of the pion, because in the reaction N ->■ N + it 
parity and angular momentum conservation forbids the pion to be produced in 
the S state. NN interaction is another matter. The scattering amplitude contains a 
propagator due to exchange of the meson. It has the form 

Tf, ~ —, r = —5 --, ' r (13.127) 

which contains the S wave as well as other waves. 12 ' 

From now on we drop the spin and isospin wave function for simplicity and 
leave the scattering amplitude in matrix form in the spin and isospin space. The 
scattering matrix for the pseudoscalar exchange is expressed as 

T fi = (27r) 4 <5 4 ( Pl + p 2 - Pi - ff 4 )g 2 (gl g2 ^V g) ( r i • r z) < 13 - 129 ) 

12) From Eq. (9.26), the Jth partial wave amplitude can be extracted using 

j( . = J dcos0f{0)Pj{0) (13.128) 
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where the subscripts 1, 2 were attached to distinguish the particles 1 and 2. To gain 
insight into the propreties of the potential, we rewrite the matrix element as 

( Pi \igf d\lp(x)y s txp(x)jt{x)\ Pl , q) (13.130a) 

~ ig j d'xe-^-r^a- ( Pl - p 3 )(0\T ■ Jt(x)\q) 

= - e -^-W* g j d^xe-'^-r^ia-VjWT ■ Jt(x)\q) (13.130b) 

where we used a charge-independent interaction and ~ means a nonrelativistic 
approximation. We made a partial integration to obtain the last equality. The ap- 
proximation is equivalent to replacing the Hamiltonian by 



gj d 3 xip(x)y 5 Tip(x)jr(x) -> - g J2r(x)°kTip(x) ■ d k jr(x) 



where ip' [ ~(x), rp (x) in the last equation are two-component fields. This is called the 



(13.131) 

Ids. This is 
static approximation. Then the scattering matrix is given by 

Sfi - d fi = -iT ft = (pm\(-i) 2 J d*xH lm (x) J d*yH mT (y)\ PlPi } 

x (btj ■ V x )(a 2 • V,)r u {0|T[*,(*)*j(y)]|0>T 2 j 
= -g 2 / d'xd A ye-^-^-^-^y 

(13.132) 

Separating the CM system R = (x + y)/2 from the relative motion r = x — y and 
integrating over R gives a <5 -function for momentum conservation: 



T fi = (2Jt) A d\ Pl + P2 - Pi - PA J d're-^l _ 
x l( Tl ■ T 2 )(Oi ■ V x )(<7 2 • V y ) j-JL J d'qe 1 ' 



t(»-l) ~* 

g 2 +/WJ 

(13.133) 

We have used E x ~ £ 3 ~ m in the denominator of the pion propagator. Then the 
content of the curly brackets is the static Yukawa potential [= —g 2 e~ flr /(4jzr)]. The 
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nonrelativistic potential in space coordinates is given as the content of the square 
brackets divided by (2 m) 2 to correct for the relativistic normalization: 

V(r) = (ri • ti){Oi ■ V x )(a 2 ■ V y ) (- / 2 6 H 1 _ (13.134a) 

/ = JLg (13.134b) 
Performing the differentiation, we obtain 

V(r) = ^(n • r 2 ) |"i(ai -^) + (y2+- s +\) S ^] Y ( s > < 13 - 135a ) 

S 12 = 3{01 "j} a2 " ) -[* 1 -* 2 ). Y(s)=^ (13.135b) 

s = /<r, s=|s|, r = x-y (13.135c) 

<7i • <72 is called a spin exchange force 13 ' and Su a tensor force. Su gives a repulsive 
force along the spin direction and attractive force perpendicular to it (or vice versa 
depending on the overall sign), and averages to zero if integrated over all angles. It 
vanishes for the S wave, therefore we will concentrate on the spin exchange force 
(the first term). 

Problem 13.10 

Derive Eq. (13.135) from Eq. (13.134). 

We have seen in Section 13.3.1 that t\ ■ r 2 gives the isospin exchange force 

^ = 1-1 ' = (13 - 136) 

The same formula holds for the spin functions and gives a positive sign for S = 1. 
The deuteron, a bound state of (pn), has I = 0, S = 1 and only the even orbital 
angular momentum I = 0, 2, . . . states are allowed, which is indeed the case. It 
has a dominant S (I = 0) state with a small mixture of D (L = 2) state. For 7 = 0, 
t\ ■ T 2 < and the sign of the potential is reversed. Namely, the scalar meson ex- 
change gives a repulsive force and cannot make a bound state if the isospin degree 
of freedom is taken into account. But if the exchanged meson is a pseudoscalar, 
Eq. (13.135) gives an attractive potential for the I = 0, S = 1 state and can explain 
the deuteron bound state. 

The appearance of the spin exchange force (~ ai • a 2 ) is due to the exchange 
of the angular momentum carried by a P-wave meson. Similarly, the appearance 
of the isospin exchange force (~ r x • t 2 ) is due to the exchange of isospin carried 
by the pion. Whenever the force carrier has a quantum number that is described 
by S U(N), N > 2 a similar exchange force term appears. The color degree of 
freedom carried by the gluon induces a color exchange force in S 1/(3), which will 
be discussed in detail in the next chapter. 

13) For the electromagnetic force, it is a magnetic dipole-dipole i 



13.6.2 
Effective Range 

A highlight of the Yukawa theory was the prediction of the n meson with m M ~ 
200 m e . It was based on the knowledge that the nuclear force is short range 
~ 10~ 13 cm. How do we know the range of a force? In the low-energy limit 
<5 j oc fc 2,+1 (see Section 13.5.3), where fc is the CM momentum. The S-wave phase 
shift <3 being ~ fc, we have in the low-energy limit 

fccot<5~ + -r fe 2 +O(fe 3 ) (13.137) 

which is referred to as the effective range approximation (see Problem 13.12). 
Then the scattering amplitude, hence the cross section, can be calculated using 
Eqs. (13.78) and (13.53). The formula (13.137) is general in the sense that it is in- 
sensitive to the potential shape as long as it falls faster than 1/r. "a" is referred to 
as the scattering length and is a measure of the target size (see Problem 13.11). It 
is positive (negative) for an attractive (repulsive) force. r is referred to as the ef- 
fective range and is a measure of the force range. Equation (13.137) means that by 
measuring low energy scattering data, a and r can be obtained as well as the sign 
of a. 

Problem 13.11 

Show that in the limit fe ->■ 0, <7 T0 t = 4jra 2 . 



Problem 13.12 

Consider S-wave scattering due to a spherical potential with finite range: 

V(r) = (r > R), V(r) / (r < R) (13.138) 

Setting E = k 2 /2m and the I = radial wave function as S(r), we define u(r) = 
rS(r)\ r<R , v(r) = rS(r)\ r>R and denote the solution at fe = as u (r), v (r). Prove 
the following equations: 

Mr) = 1 - \ (13-139) 

fccot<5 = + -r fe 2 +O(fe 3 ) (13.140) 

r = 2J ir[vl-u\] (13.141) 

Using the potential Eq. (13.135), the properties of the deuterons and low-energy 
N N scattering behavior were reproduced well by setting 
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which was the original success of the Yukawa theory. However, the coupling con- 
stant is ~ 1000 times bigger than the electromagnetic interaction and is too big for 
perturbational treatment. This was the primary difficulty in constructing the theo- 
ry of strong interactions. It was only after the quark model was established that a 
mathematical framework of the strong interaction, i.e. QCD, was developed. 



13.7 

High-Energy Scattering 



13.7.1 

Black Sphere Model 



As we have seen in Fig. 13.8, jtN cross sections in the low-energy region (T < 
1 GeV) show a rich structure with peaks and dips. At high energies, however, many 
resonances overlap and begin to show a smooth structure. Furthermore, contribu- 
tions from inelastic channels dominate and the fraction of elastic scattering cross 
section diminishes (Fig. 13.25). Above 10 GeV, the Tip total cross section is almost 
flat, rising slowly (~ log 2 E) above 100 GeV (Fig. 13.26). The cross section in this 
region (10 < E < 100 GeV) is ~ 25 mb. If it is replaced with the geometrical area of 
acircle jzR 2 , R = 0.9 x 10 -13 cm and reproduces the strong (nuclear) force range. 
Besides, as can be seen from Fig. 13.26, cross sections of the particle and antipar- 
ticle (p and p, n~ p and Jt + p, K~ p and K + p ) approach each other. There is an old 
prediction that their cross sections should agree at high energies, which is known 




Figure 13.25 ji+ p total and elastic 



nd their behavior at high energy [311]. 
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Figure 13.26 High-energy behavior of cr 
dependence [313]. 



as Pomeranchuk's theorem [302, 325]. Combination with isospin 

a(jt-n) = a(n + p) = o(jT P ) (13.143) 

Qualitatively we may expect that the black sphere approximation reasonably re- 
produces the high-energy behavior because inelasticity grows as the energy goes 
higher, becoming totally absorptive in the high-energy limit. The argument goes as 
follows. Ignoring spin effects, the elastic scattering amplitude can be expanded in 
terms of partial waves [Eqs. (13.53c), (13.91)]: 



/(s,0) = ^(2l + l)/ l (s)P 1 (co 

11 2i P 



(13.144a) 
(13.144b) 



If there is no absorption rj\ = 1 and for total absorption rj \ — 0. In the black sphere 
model, total absorption in all partial waves is assumed [rj\ = 0) and only sums up 
to I < I max are taken. I max = pR, where R is the force range, is introduced from a 
semiclassical argument. Then the forward scattering amplitude f{6 = 0) can be 
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obtained by inserting P;(cos# = 1) = 1 

Application of the optical theorem [Eq. (9.56)] gives 

<7 TOT = — lm/(s,0) = 2nR 2 (13.146) 

Equation (13.146) reproduces the approximate value and constancy of the observed 
cross section. However, quantum mechanically, the force range is not cut off 
abruptly at a certain value R but extends beyond its limit. From a purely analytical 
argument, I max ~ */slns can be deduced, which sets an upper limit to the total 

(Jtot < constant x (In s) 2 (13.147) 

This is referred to as the Froissart bound [150]. Observed data above ~/s > 100 GeV 
seem to saturate and satisfy this limit (Fig. 13.26). 

So qualitatively the black sphere model seems to work. However, it cannot repro- 
duce the angular distributions in high-energy elastic scattering. Using approxima- 
tions valid at small angles 6 ~ 

t = -4p 2 sm 2 6/2^-( P 0) 2 

(13.148) 
P,(cos0)«J o ((J + 1/2)0) 

we can calculate 

i pR i r? R 

f(s,0) = — J2( 2l + 1)Pi(cos0) « — / 2xdxJ (x6) 

P i=o P Jo (13.149) 

iR ipR i — 

^ T h{pR0)=^=] x {R^i) 

This is a typical diffraction pattern with dips and peaks. 14 ' Experimental data give, 
irrespective of particle species, 



where a and b are constants dependent on particle species. This is different from 
the angular distribution given by the black sphere model. 

14) This is a striped pattern simlar to the Fraunhofer diffraction that light waves make through a 
pinhole. 
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13.7.2 

Regge Trajectory* 



To understand the observed angular distributions, let us examine the quantum 
field theoretical possibilities. Interactions (scatterings) are, in general, mediated by 
exchange of particles. Let the spin of the exchanged t-channel particle be J. Then 
the high-energy scattering amplitude mediated by a particle exchange has the form 

t-rrij 

It is smooth as a function of s and has the property of becoming small as |t| grows. 
g 2 j can be considered as the square of the coupling constant. Indeed, the one-pion 
exchange model is known to reproduce high-energy small-angle scattering behav- 
ior qualitatively. We expand the notion and consider that the scattering amplitude 
can be expressed as a sum of particle exchange amplitudes of various kinds. As 
these particles give poles in the complex t-plane, a dominant contribution to the 
scattering amplitude with given t naturally comes from the nearest pole. For s-chan- 
nel scattering, t is negative and as far as small-angle scattering (t ss 0) is concerned, 
the contribution of a particle with the smallest mass, namely that of n, should dom- 
inate. By taking into account other heavier particles, we are led to the idea of writing 
the scattering amplitude as a sum of poles generated by t-channel particles and res- 
onances. Such formalism is naturally realized in the crossed t-channel amplitude 
introduced in Section 6.4.2. As an example, let us consider the s-channel Tip ampli- 
tude starting from the t-channel nn scattering. We expand it in partial waves in the 
t channel. Note, however, the crossing symmetry can be applied only for Lorentz- 
invariant amplitudes, so we have to use A(s, t) = — 8jt^~sf(s, t) instead of f(s, t) 
[see Eq. (13.53b)]. The partial wave expansion formula in the t channel is given by 

A(s, t) = J^i 21 + l)A;(t)Pi(cos 6 t ) (13.152a) 

cos t = -1 S —, (13.152b) 

t-v [ ' 

where fi is the n mass. In order to convert this amplitude to the s-channel np 
scattering amplitude, all we need to do is to analytically continue from the t > f-i 2 , 
s < region to s > 0, t < 0. However, when cos 6 t is considered as a complex 
variable, the convergence area of the Legendre function P;(cos 6 t ) is limited to in- 
side an ellipse tangential to the nearest pole and with foci at cos 6 t = ±1. In order 
to circumvent the difficulty, Regge regarded the amplitude as a function of com- 
plex variable I. In this case, it can be rewritten in a form known as the Watson- 
Sommerfeld transformation 



■u 



t Pi(-cosfl t ) 

njtl 



A(s, t) = - dl(2l + l)A(l, t) " " (13.153) 



s described in 13.27a. The formula is equivalent to Eqs. (13.152), 
because sin jtl has poles at I = 0, ±1, ±2, • • • with residue (— 1)'/Jt. Then we de- 
form the contour as C 2 in Fig. 13.27b. The properties of the Legendre function 



;h-Energy Scattering I 495 




Figure 13.27 (a) Integratioi 

formed integration contour C2 consists of a few ci 

dashed line Re{/} = -1/2. 



P; (cos 6 t ) are well known. It is convergent as long as the contour C 2 stays to the 
right of Re{I} = —1/2. The problem is the behavior of A(l, t), but Regge showed 
that for nonrelativistic potentials (superposition of Yukawa potentials) it has at 
most a few poles, which are referred to as Regge poles. Writing the Regge pole 



A(l,t)~ 



/»»(') 



l-a»(t) 
Eq. (13.153) can be rewritten a: 



>-*L 



E (2«( t) + y B ( t ) 



(13.154) 



tit <0, 



As I, a„ are complex numbers, convergence is not lost if we go to the n 
s > 0, and thus analytic continuation is possible. 

Properties of Regge poles for potential scattering are well studied. As the energy 
increases, the pole position of a n (t) on the Re{I}-Im{!} plane moves with t as 
depicted in Fig. 13.28a; this is called a Regge trajectory. Starting from Re{!} = 
—1/2, the Regge pole moves to the right on the real axis, departs from it at some 
value oft (at the threshold t = 4fi 2 in the jijz channel where the jtjt scattering 
becomes physical), goes to the upper right and eventually changes direction to go 
back to Re{I} = —1/2. When a„(t) is an integer on the real axis, it represents a 
bound state. After passing the threshold, when it has integer real part and a small 
imaginary part, it represents a resonance. To see this we calculate the contribution 
of the Regge pole to the partial-wave amplitude. Using the relation 



\L 



dzP l (z)P a (-z)=- 



(13.156) 
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A Regge pole moves in the 
me as the energy changes; its track 
a Regge trajectory. A pole on the re 
^responds to a bound state while 



Im{/} > corresponds f 
The behavior of a Regge trajectory in a nonrel- 
ativistic potential, (b) A nonrelativistic Chew- 
Frautschi plot. 



n obtain 



<sd A{t,cos6)Pi(cos8) 
+ l)/3„(t) 



(a„(t) - l)(a n (t) + I + 1) 



If the Regge pole is located near the real axis, we can expand the above equation in 
the neighborhood of Re{a„(t r )} = I and the partial amplitude becomes 



(t 



>,)- 






<*„(*,)} 



which is a well-known resonance formula. 

A diagram in which Re{J| is displayed as a function of energy is called a Chew- 
Frautschi plot. For potential scattering (see Fig. 13.28b), Re { Z J diminishes at high 
energy. The field theoretically extended version or phenomenologically obtained 
plot seems to extend linearly without limit (Fig. 13.29 [98]). 

In applications to field theory, we need to take into account exchange forces. For 
instance, when the t channel is JtJt, the bosonic nature of n together with isospin 
independence restricts I to either odd or even numbers. In other words, we have to 
replace the complex Legendre polynomial by 



P an (cos0)^-[P an (cos0)±P a „(- 



>S0)] 



(13.159) 



For the Regge trajectory including 
to be adopted and for ++ , 2 ++ 



vectors such as p,(a,<j>, the (— ) sign has 
including f , a 2 , the (+) sign has to 
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t = (Mass) 2 GeV 2 



Figure 13.29 A Chew-Frautschi plot. Meson resonances are shown [98]. The data in the nega- 
tive t region can be obtained by fitting the data to the t-channel particle exchange amplitudes. 
Circles were obtained by fitting to ji~ p — *■ jr°n data should lie on the extended p trajectory. 



be chosen. Therefore for one Regge trajectory, resonances appear at every other 
integer with Al = 2. For a parity-degenerate trajectory, A I = 1. 

Now let us see how the s channel amplitude behaves if analytically continued 
from the t channel. We are interested in np scattering for values s -> oo, t < 0. 
As Pi(— cos 0) -» s 1 in the limit s ->■ oo, a Regge pole positioned at the extreme 
right (the highest trajectory in the Chew-Frautschi plot) gives the dominant contri- 
bution. Therefore, instead of particles with discrete integer spin, we are talking of 
ing amplitude due to exchange of Regge poles with continuous I value: 



A(s,t) «-«£>»(*) + l/2)/8 B (t) 



*..(- 






■E 



'»(*) 



where s is a parameter having the same dimension as s and a n (t) denotes a Regge 
pole representing bound states or resonances in the t channel. The numerator in 
the square brackets is a generalization of Eq. (13.159) and, depending on r = ±, it 
is called an even (odd) trajectory. Using dajdt ~ |A(s, t)| 2 /s 2 , we obtain 






fdrjl 



g 2[ U (l)-l]ln( s / 



\ln\ =(1+^ 



Re{A) 
" Im{A} 



(13.161a) 
(13.161b) 
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Since we can write a(t) rs a(0) + a't in the vicinity of t « 0, we have reproduced 
the observed t dependence Eq. (13.150). 

In order to have an approximately constant total cross section (ct T ot ~ — ImA(t = 
0)/s), A(s, t) ~ s is necessary, which, in turn, means a P (0) = 1 + e, e ~ 0. If this 
Regge trajectory has the same quantum number as the vacuum, the particle and 
antiparticle share a common dominant term, leading to Pomeranchuk's theorem. 
Therefore, the trajectory a P is called the Pomeranchuk trajectory or Pomeron. 

A t-channel Regge trajectory exchanged in a charge exchange scattering such as 
ji~ p — >■ Ji°n has quantum number I = 1. It becomes the p resonance at t = m 2 
and I = 1 [i.e. a p (m 2 ) = 1], and is called the p trajectory. It crosses the point where 
a p (0) = 1/2 (see Fig. 13.29), and makes the cross section behave as s _1 . 

Other trajectories corresponding to mesons having different quantum numbers, 
for example, co (I = 0, J p = 1") and K*(I = 111, J p = 1") follow more or less 
the same path. 

Crossing between s o u converts one member to an antiparticle, but its effect 
on the Regge trajectory is simply an exchange cos 6 t <-»■ — cos 6 t and it changes it 
only by a multiplicative factor r, as can be seen from Eq. (13.160). Consequently, 
by taking the difference of the cross sections of the particle and antiparticle [for 
instance, o(ji~ p) — o{n+ p)], we can extract the Regge trajectory having r = — 
(here p trajectory) and by taking the sum of the two we can separate that with r = + 
(Pomeron, f , etc.). If we assume the scattering amplitude takes the form given by 
Eq. (13.151), the residue of a Regge pole exchanged in the reaction A + B — >■ A + B 
can be written as a product y„ ->• /RAAyRBB (factorization). Here, yRAA, 7rbb are 
quantities proportional to the coupling constant of the Regge trajectory with the 
particle A or B, and can be related by S U(3) symmetry, as explained in the next 
chapter. Thus all hadronic total cross sections may be expressed by a small number 
of parametrized Regge poles. Furthermore, in the t region below threshold, the 
phase of the Regge trajectories depends only on the factor 

S=-^-ti (13.162) 



Re[/(s,0)]= tan[ Y a(0)JxIm{/(s,0)} r=- (13.163a) 

= -cot[|a(0)]xIm{/(s,0)} r = + (13.163b) 

Using parameters obtained by fitting the formula to the data, we can predict the re- 
al part of the forward scattering amplitude. As trajectories other than the Pomeron 
contribute to the real part, and dominant trajectories (p and f , etc.) among them 
have a(0) = 1/2, we expect the real part to decrease as ~ s -05 at high energies. Ex- 
perimentally, the real part can be determined by measuring the interference effect 
of the strong scattering amplitude with the Coulomb scattering; the result confirms 
our prediction [45, 46]. 
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FOUR-MOMENTUM TRANSFER SQUARED, |t| (GeV 



Figure 13.30 p p el 



ta [183]. At high energy, a diffraction peak appears, wh 



of |t| for a variety of incident i 



Phenomenologically, the behavior of the high-energy total cross sections can be 
well reproduced by using two effective Regge poles, the Pomeron and Reggeon 
[312]. The latter is an effective Regge trajectory with p,co, f, etc. combined: 15 ' 



e = a P (0)-l = 0.093 ±0.002 (13.164) 

r] = 1 - a R (0) = 0.560 ± 0.017 

where s is in GeV 2 . X ah and Y ah are parameters that depend on particle species. 
The first term represents the Pomeron and the second the Reggeon. 

15) [312] gives three terms, separating the Reggeon contribution into C = +1 and —1 parts, thus 
reproducing the small differences between the particle (o ab ) and antiparticle (o ah ) cross sections, 
but with somewhat poor fitting the difference can be neglected. Recently, owing to accumulation of 
more precise data, the Pomeron contribution has been modified to Xs r -* Z + B log 2 (s/s ) 2 [311]. 
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According to Eq. (13.161), the coefficient oft in the exponent grows with s. This 
means that the forward scattering peak shrinks as s becomes larger. This is ob- 
served in p p scattering (Fig. 13.30), but in many other channels it is not the case. 
This means either s is not large enough to be in the asymptotic region or a simple 
approximation with a few Regge poles is not sufficient to reproduce the data. 

In summary, the Regge model is very attractive as a way to explain the high- 
energy behavior and is frequently used in representing the asymptotic behavior of 
the scattering cross sections. In the language of the Standard Model, the Pomeron 
is not due to a specific resonance but a dynamical composite effect of multiple 
gluons. In the Regge representations, a scattering amplitude in terms of t-channel 
low-energy resonances describes the high-energy part of the s channel, and the 
converse is also true. Thus, there is a duality in the scattering amplitude. Extending 
the idea, each scattering amplitude in the crossed channel may be totally expressed 
as a sum over resonances in s, t, u channels and indeed such a model was created 
(Veneziano model [378]). The duality in which one scattering amplitude can be 
considered as both an s and a t channel amplitude induced a (hadronic) string 
model [288, 329, 358], which, in turn, triggered the modern string models. In this 
sense, the analysis with Regge trajectories may be said to be an important turning 
point in the history of particle physics. 



14 

The Quark Model 



Nucleons and pions have been thought of as elementary particles for a long time. It 
took over 30 years of trial and error to realize they are composites made of quarks. 
During the process, there were two turning points. One was the discovery of the 
strange particles, which began in the late 1940s, and the other that of charmed 
particles in 1974. By the time the charmed particles were discovered, particle tax- 
onomy was fairly advanced and the significance of the discovery was immediately 
recognized, creating a turbulent but transient transformation of the particle view 
referred to as "the November revolution". On the other hand, the characteristics of 
the strange particles have been a central riddle, something researchers had to pur- 
sue for a long time. In retrospect, when the characteristics of the strange particles 
were understood, the essence of the quark model was already in hand. The process 
was slow and did not create a fever like that of the charmed particles, but was far 
more important in its contribution to the progress of particle physics. 



SU(3) Symmetry 

Historically, the discovery of strange particles added to isospin another conserved 
internal quantum number, the strangeness S. Just as isospin ( S U(2) symmetry) 
was introduced when the equality of both the mass and the properties of the pro- 
ton and neutron was noticed, the near equality (m p ~ m„ ~ m A ) of three basic 
elements of the Sakata model [339] led to S U(3) symmetry, known as IOO sym- 
metry [212] at the time of introduction. Based on the Sakata model and S U(3), 
the meson octet was successfully explained. The discovery of the Q particle firm- 
ly established the validity of S U(3). However, one misplacement of p, n, A as the 
three basic elements, hindered the Sakata model and the "eightfold way" took over, 
which, in turn, led to the quark model. With the discovery of the charmed parti- 
cles, S U(3) was extended to S U(4), but all the essential features of so-called flavor 
symmetry are contained in S U(3) and the quark model. In this chapter we explain 
that static properties of the hadrons, including the mass and magnetic moment, 
are well reproduced by the quark model. We defer discussions of the dynamic 
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properties such as the inner structure of the hadron, which can be investigated 
by scattering experiments, until Chap. 17. 



14.1.1 

The Discovery of Strange Particles 



Around the late 1940s, Rochester and Butler [332], while investigating reactions 
induced by high-energy cosmic rays, observed strange particles that left V-shaped 
tracks in cloud chambers (Fig. 14.1). 

Some of them were neutral, decaying into 7i~p, or to n~ jr+ (Fig. 14.1a) and 
others were charged and decayed into n~ n or tc tc° (Fig. 14.1b): 



Vx -» 71 



' + P 



The pion was a new particle when it was discovered, but it was theoretically antic- 
ipated as a necessary ingredient of the nuclear force carrier. In contrast, theorists 
anticipated neither the existence nor the necessity of the "V" particles. Namely, they 
could not understand their properties. The most outstanding puzzle was that they 
were produced frequently in the cloud chamber, meaning the production was a fast 
process, but they left visible tracks, which is a slow process. The fact that the pro- 
duction was fast meant it was induced by the strong force. Then if the "V" particle 
were a strongly interacting particle, its decay time should be 10 — 24 -10 — 22 s (some- 
times referred to as the strong time scale) and it could not leave visible tracks. 
Visible tracks as long as a few centimeters mean a decay time of 10 — 10 -10 — 8 s, 
which is a typical weak interaction time scale. The riddle was that a particle inter- 
acted strongly at one time and weakly at another. Then it was realized that all the 
"V" particles were produced in association with another "V" particle [310]. Nakano 
and Nishijima [286] and Gell-Mann [159] independently proposed a new quantum 
number "strangeness S" to explain the controversy and that it is conserved in the 
strong interaction but not in the weak interaction. 




Figure 14.1 Production of new particles in the Wilson cloud chamber. Among the mar 
ers penetrating through a thick lead absorber, a V-shaped track (a,b) and a kink (c,d) a 
served [332]. 
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Let us rename Vi, V 2 as A, K° and assume they have strangeness S = — 1 and 
S = +1, respectively. All the hitherto known particles, including 7t, N, are as- 
sumed to have S = 0. Then the strangeness number in the production reaction 
can be written as 

n~ + P ^ K° + A 
S : +1 -1 [ ' 

S is conserved before and after the reaction. On the other hand, in the decays 



S is not conserved and the strong interaction is forbidden. Similarly, if V 3 , V 4 are 
E~, K + , the reaction becomes 

n~ + p -» K+ + E~ (14.4) 

The reaction is clearly visualized in a bubble chamber picture exposed to the Beva- 
tron beam at LBL (Lawrence Berkeley Laboratory) (Fig. 14.2). 

Interestingly, these strange particles have partners with different electric charge 
but with nearly the same mass. Namely they also form isospin multiplets, just like 
nucleons and pions. Nishijima and Gell-Mann proposed a relation between the 
strangeness S, the electric charge Q (in units of the proton charge), and the third 
component of the isospin: 

Q = 

Y = B + S 

Here, B is the baryon number and Yis called the hypercharge. This is a simple and 
yet very powerful rule that is used to predict missing partners of isospin multiplets. 
The law applies to all the hadrons discovered later. Table 14.1 lists a sample of 
hadrons. Particles with B = are called mesons, B = 1 baryons and B = 1, S ^ 



Table 14.1 Quantum numbers of representative hadro 



y-1 (K+,K°) (K*+,K*°) (p,n) 

Y = (Jt+,jt°,jt-) (p+,p°,p~) (I + ,I°,I-) 
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Figure 14.2 A bubble chamber picture ta 
en at the 6 CeV Bevatron/LBL. n~ + p -*■ 
K° + A, K° -+ it- + K+, A -> ut~ + p. 
Photo LBL [254]. A bubble chamber is a vessel 
filled with a superheated transparent liquid 
(hydrogen, propane). Ions made by a passing 
charged particle act as evaporation centers, 



around which bubbles form, creating a track. 
The mechanism is similar to that of the cloud 
chamber with gas and mist replaced by liquid 
and bubbles. The high density of the liquid 
makes the bubble chamber a very effective 
particle detector. It was invented by Glaser in 
1952 [172]. 



hyperons. All the hundreds of hadrons so far discovered satisfy the Nishijima-Gell- 
Mann law. 

In the 1960s, after the Cosmotron (3 GeV) at BNL and the Bevatron (6 GeV) at 
LBL began operation, detailed measurements of KN scattering and systematic 
searches for KN, KKN,jiY, KY channels in multiparticle productions were car- 
ried out, clarifying the existence and properties of S = 0,-1,-2 particles and 
resonances. We have explained part of the story already in the previous chapter 
(Fig. 13.14-Fig. 13.19). Hundreds of hadrons were thus discovered. Then, one nat- 
urally raises the question of whether they are really elementary. If they are consid- 
ered as composites of more fundamental particles, what properties should these 
have? 
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14.1.2 

The Sakata Model 

Hadrons have a variety of spins and isospins (s, 7 = 0, 1/2, 1, 3/2, . . .). As any spin 
(or isospin) can be constructed from s = 1/2(7 = 1/2) bases, the fundamental par- 
ticles must comprise s = 7 = 1/2 particles, which requires at least a pair of s = 1/2 
particles with different electric charge. Conversely, two kinds of species with this 
attribute can compose any value of spin s and isospin 7. But to have the strangeness 
S, a third fundamental particle with nonzero S is necessary. The Sakata [339] model 
chose (p, n,A) as the three fundamental particles. Noticing the nearly same value 
of the mass of these particles, Ikeda, Ohnuki and Ogawa [212] proposed that the 
strong interaction is invariant under the exchange p ** n ** A ** p (IOO sym- 
metry). This is the introduction of S 17(3). 1( However, just as isospin conservation 
(STJ(2) symmetry) was an approximate symmetry, S [7(3) symmetry (sometimes 
called unitary spin) is even more so. The mass difference between p and n is ~ 0.1% 
but that between the nucleon and A is ~ 20%. S U(3) symmetry is considered valid 
only within this accuracy. 

Let us define three basis vectors of the S 17(3) group ip and write them in matrix 



A triplet (p,n,A) represented as a 3 x 1 column vector is a three-dimensional 
representation of SU(3) having (7 3 , Y) = (1/2, 1), (-1/2, 1), (0,0) and is denot- 
ed as 3. The antiparticles can be represented as a 1 x 3 row vector (p,n,A) = 
(li, |i, &). The antitriplet has quantum numbers opposite to 3, i.e. (7 3 , Y) = 
(-1/2, -1), (1/2, -1), (0, 0) and belongs to 3*. 3) Denoting U as a unitary uni-mod- 
ular (det 17 = 1) 3 x 3 matrix, the S L/(3) transformation can be expressed in terms 
of eight (= 3 2 — 1) hermitian traceless matrices A; as 



U = exp -^^0, 



Tr(Aj) = 

1) To be exact, this is flavor S [7(3), to be 7 3 map. For S U(2), the quantum numbers 
distinguished from color S [7(3) to be of the antidoublet are the same as those 
described later. The latter is the symmetry of of the basis doublet. Hence by a suitable 
the color charge that the quark has. transformation (p,n) -> (n, —p), they can 

2) For the definition of groups and be made to belong to the same I = 1/2 
terminologies, refer to Appendices A and G. representation. [See Eq. 9.220] 

3) Note 3* ^ 3 in S 17(3), but 2* = 2 in S 17(2), 
because they occupy the same point on the 
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(where 0; are eight independent real numbers and the factor 1/2 in the exponent 
is a historical convention) just as the S U(2) transformation matrix is expressed in 
terms of three (= 2 2 — 1) Pauli matrices. For A;, the convention is to adopt Gell- 
Mann matrices, which are defined as 



ro i oi 




"0 -i 01 


1 


, h = 


i 


0. 




_0 0_ 


ro o li 




"o o -n 





, h = 





_1 o o_ 




J 0_ 



0" 




'0 0" 




1 0" 
0-10 




0' 




The number of diagonal matrices is two, which is the rank of S U(3). The eight A; 'i 
satisfy the following commutation relations: 



ifijk- 



(14.9a) 
(14.9b) 



fijk is called the structure constant of S U(3), where {a, b\ = ab + ba and fiju, 
dijk are totally antisymmetric or symmetric with respect to ij k permutations and 
have the values listed in Table 14.2. 



Table 14.2 Structured 



fw = fl4S = f2S7 = /345 = / S 16 = hv 



di 46 = di5 7 = (i 2 56 = ^344 = ^355 = ~ 



= d 558 = 1^668 = ^779 = 
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14.1.3 

Meson Nonets 

Composites of a baryon and an antibaryon have the baryon number B = 0, hence 
they can be identified as mesons. Since under S 1/(3) transformation 



</' 


-> %p' = Jjxp , xp 


^xp' = 


xp U~< i] 








(14.10) 


the combination xp'i' xp — £;| 


= pp + nn + AAi 


invariant and hence 
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= ±(p P+ *n + 


A A) 










(14.11) 


makes an irreducible representation by itself of dimension 1 
ing eight elements made up of the combinations 


(singlet 
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(14.12) 

mix with each other by the transformation and constitute the bases of an irre- 
ducible representation of dimension 8 (octet). The above process of multiplication 
and decomposition can be expressed in the simple form 

3®3*=1©8 (14.13) 

The process is an extension of S 1/(2) isospin addition to S 1/(3). In S U(2), when 
I = 1/2 and 1/2 are added together, they can be decomposed to J = and 1 = 1, 
which, in the above notation, can be expressed as 2 eg) 2 = 1 © 3. By comparing the 
quantum numbers of the octet members, the observed mesons can be identified as 



= t 1 ^, ^° = 


-d 1 !! - i 2 iz)/V2, jt- 


= -| 2 li 


= | 1 |3, K° = 


?h, K° = -| 3 | 2 , K~ 


= £ 3 !i 


= (£% + £ 2 & 


-2| 3 | 3 )/V6 





'/ 

The negative sign of ji°, ji~, K reflects the fact that (n, —p) makes an I = 1/2 
doublet [see Eq. (9.220)]. The construction of neutral particles is not unique, but 
considering the isospin is a good quantum number, jz° is made by applying J_ = 
h — ih to jr+ and t] is made to be orthogonal to both jt° and tj' '. Using Eq. (14.14), 
Eq. (14.12) can be rewritten as 



-M 
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Mapping octet members on the I 3 -Y plane gives Fig. 14.3a. Note, in Fig. 14.3 the 
quark model (u, d, s) is used instead of (p, n, A) for the sake of later discussions. 
Meson octets have the same (I 3 , Y) asignments in both the Sakata and quark mod- 
els. Such a diagram is called a weight diagram. They reproduce a spectrum of ob- 
served spin = mesons, although when it was first made, r\ and rj' had not yet 
been discovered. For the spin = 1 members, replacements n -*■ p, K -*■ K* ,r], 
r\' —*■ w, (p make the weight diagram as shown in Fig. 14.3b. Thus, the Sakata 
model prepared a mathematical tool S 1/(3) and obtained the correct meson as- 
signment, but because it put aside (p, n, A) as a special trio, it failed to reproduce 
the baryon octet correctly (see Fig. 14.4b). The correct assignments were made by 
the eightfold way of Gell-Mann [160] and Ne'eman [294]. From then on the Sakata 
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Figure 14.3 Nonet = singlet + octet made of (u, d, s) and (u, d, s). Numbers in parentheses 
are mass values in MeV averaged within the isospin multiplet. (a) Correspondence between 
observed 0~ mesons and quark-antiquark pair combinations, (b) Same as (a) for 1 — mesons. 
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Figure 14.4 (a) An octet made of three quarks, (b) Observed (1/2)+ baryons. 
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model was superseded by the quark model. The arguments for the meson we have 
developed so far are equally valid for the quark model, too. 

14.1.4 

The Quark Model 

The eightfold way confers a more fundamental role on the octet since, experimen- 
tally, baryons with J p = (l/2) + as well as mesons were observed as octets. If 
hypercharge Y — S + B is adopted instead of the strangeness S as in the Saka- 
ta model, both baryons and mesons can be treated in a unified way. Though the 
eightfold way could reproduce the observed multiplets correctly, it lacked logical 
justification of why S U(3) works. If one asks for the existence of the fundamental 
particles as bases of the octet, one needs three bases. They are similar to p, n, A of 
the Sakata model but have slightly different quantum numbers and are referred to 
as u, d, s quarks. If they have baryon numbers, 3* ® 3 = 1 © 8 can reproduce meson 
octets. But to realize an octet with baryon number 1, a different combination has 
to be introduced. As it happens, a product of three 3's can make an octet that can 
be identified with the baryon octet. If that is so, the baryon number of the quarks 
must be 1/3. Now one uses the same assignment of the Sakata model except for 
the baryon number and requires that Nishijima-Gell-Mann's law Eq. (14.5) applies 
to quarks, too. Then their quantum number can be assigned as in Table 14.3. 

According to Table 14.3, the electric charge of the quarks would become 2/3 or 
—1/3, not integers. This was an extraordinary concept at that time and even the pro- 
poser Gell-Mann [161] claimed for a while that the quarks were not real but should 
be considered as a convenient mathematical tool. The other proposer, Zweig [400], 
is said to have thought them real. According to the modern interpretation, all the 
strong interactions originate from color charges that all the quarks carry and their 
strength does not depend on the quark species. As m„ ~ m<; ~ m p /3 ~ 330 MeV, 
m s ~ m u + (my[ — m p ) ~ 500 MeV can be inferred from phenomenological consid- 
erations [see Eq. (14.57)], it is clear that the isospin [S U(2)] and unitary spin [S U(3)] 
are approximate symmetries whose validity is limited to phenomena where the 
mass difference is negligble on the energy scale in question. Namely, S (7(3) sym- 
metry under exchange of u, d, s is rather accidental. As the quark species u, d, s, . . . 
are said to differ in "flavor", this symmetry is now referred to as flavor S U(3) to 
distinguish it from color S 1/(3), which plays a fundamental role in the Standard 

Table 14.3 Quantum numbers of the quarks. 



it (up) 1/2 1/2 1/3 1/3 2/3 

d(down) 1/2 -1/2 1/3 1/3 -1/3 

I (strange) 1/2 -1 1/3 -2/3 -1/3 
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Model. Historically, however, flavor S 1/(3) was instrumental in reaching the quark 
structure of hadrons. 

14.1.5 

Baryon Multiplets 

To construct baryons, let us first look at the transformation property of nine prod- 
ucts of two quarks T 1J = |'£ J . They are not irreducible, but if they are divided 
into symmetric and antisymmetric parts 

r ij = j(ts J - s J n + \a^ J + z j n = a i j + s'j (i4.i6) 

each part transforms to itself because the symmetry property is conserved in the 
transformation (Problem 14.1). In other words, nine product tensors T' J are sepa- 
rated into three antisymmetric and six symmetric tensors: 

3 (g 3 = 3* © 6 (14.17) 

Why the three A'J's belong to 3* and not to 3 is clearly seen by looking at the 
quantum numbers they possess. Alternatively, the transformation property of r] j. = 
2Aij = 6 i j 1 ;|'§-' can be shown to be the same as that of (£ 1 )* (Problem 14.2). 
Multiplying Eq. (14.17) by one more 3, the first term on the rhs of Eq. (14.17) gives 

3* (g 3 = l © 8 (14.18) 

just like the meson nonets. However, unlike the meson singlet, 1 here is a totally 
antisymmetric combination of A ,J and £ fe , i.e. if>i = eijk^'^^ k (Problem 15.3), 
and the remaining eight tensors become an octet. From the second term 6, 18 ten- 
sors can be made. Among them, ten are totally symmetric and constitute a decuplet 
whereas the reamining eight tensors make another octet. Namely we have obtained 

6®3 = 8©10 (14.19) 

In summary, we have derived 

3 ® 3 ® 3 = 1 A e 8ma © 8 MS ® 10s (14.20) 

Among them, 1 is totally antisymmetric and 10 is totally symmetric with re- 
spect to three indices. The two octets, 8 M a, 8 ms occupy the same position on the 
I3—Y plane, but as can be seen from the way they are constructed, they have mixed 
symmetry, one antisymmetric and the other symmetric in the first two indices. Ex- 
perimentally, (l/2) + baryons constitute 8 (Fig. 14.4) and (3/2)+ baryons constitute 
10 (Fig. 14.5). This means the real baryon (1/2)+ octet is a combination of the two 
octets with mixed symmetry. 
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Figure 14.5 (a) 10 can be constructed as totally symmetric combinations of three quarks, (b) 
Observed spectrum of (3/2)+ resonances. The numbers on the right denote their mass in MeV. 

Problem 14.1 

Show that the symmetry property of Eq. (14.16) does not change by transformation. 

Namely what was (anti) symmetric before is also (anti) symmetric after the transfor- 



Show that A'J in Eq. (14.16) belongs to 3* in two ways: 1) by looking at the quan- 
tum number and 2) by looking at its transformation property. 



Show that a totally antisymmetric wave function ip\ = e; J j : | I |-'| i: 
under S 1/(3) transformation and hence constitutes a singlet. 



14.1.6 

General Rules for Composing Multiplets 

In order to understand the basic rules for constructing multiplets from basis vec- 
tors, we have shown some simple examples. When the dimension of the multiplets 
increases, the method by hand like the one we have used rapidly becomes complex. 
There are general formulae in [S U(2)] for constructing any value of angular mo- 
mentum J out of spin 1/2 bases using the Clebsch-Gordan coefficients. There is 
a similar formula in S 1/(3) and tables of the corresponding Clebsch-Gordan co- 
efficients are available (see for instance [311]). We refer to Appendix G for more 
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detailed arguments. We restrict arguments here merely to a simple extension of 
S U(2) to S U(3) and how to use tables derived from general addition rules of S 1/(3). 
In S U(2), a multiplet is specified by the angular momentum j. 5 ' Elements in a 
multiplet are specified by the eigenvalue of J z = m = (j, j — 1, • • • , — j). A group 
consisting of (2j x + l)(2j 2 + 1) elements made by multiplying the 2]\ + 1 elements 
of a multiplet specified by ji by the (2 j 2 + 1) elements of another multiplet j 2 gen- 
erally forms a reducible group. But such a group can be decomposed into a sum of 
irreducible groups (multiplets) as 

Ji 9 j 2 = (Ji + it) ® (h + Ji - 1) © ••• © (I h - j 2 \) (14-21) 

Elements of the irreducible group belonging to (J, M) can be expressed in terms of 
Clebsch-Gordan coefficients as (see Appendix E) 

\JM) = J2 C^ lj2m2 |j 1 m 1 )|j 2 m 2 ) 



In S 1/(3), specifying the dimension alone is not enough, as we saw that 3* ^ 3. An 
additional indicator is necessary. This is because there are two Casimir operators in 
S U(3). It is convenient to use a pair of numbers (p, q) to specify a multiplet where 
p, q are effective numbers of quarks and antiquarks 6 ' . The dimension is given by 

D(p, q) = \{p + l)(q + l)(p + q + 2) (14.23) 

We list some of the correspondences between the notation 1, 3, etc. and D(p, q): 

D(0,0) = 1, D(l,0) = 3, D(0,l) = 3*, D(l, 1) = 8 = 8* , 
D(2,0) = 6, D(0,2)=6*, D(3,0) = 10, D(0,3) = 10* .... 

The decomposition corresponding to Eq. (14.21) in S U(2) can be made using 
Young diagrams and expressed as 

D( Pl , qi ) ® D(p 2 ,q 2 ) = ^n{p,q)D(p,q) 

(14.25) 
An example: 8®8 = 1©8©8©10©10*9 27 

The difference from S U(2) is that in Eq. (14.21), specification of J is enough to 
uniquely determine the representation, while in S U(3) there is more than one rep- 
resentation with the same dimension and n(p, q) becomes an integer larger than 1. 
In the above example, two octets appear with differing symmetry. The composition 



5) In mathematical language, by the eigenvalues of the Casimir operator, which cc 
all the transformations in the group except the trivial identity operator. In S L/(2) it is J 2 with 
eigenvalue j(j + 1). 

6) An antisymmetric pair of quarks (or antiquarks) are equivalent to an antiquark (or a quark). See 
arguments following Eq. (14.17) and Appendix G for details. 



formula corresponding to Eq. (14.22) is 

Here, [i specifies the multiplet expressed in (p, q) and the subscript y attached to 
[i differentiates fi with the same dimension, v denotes different members within 
a multiplet, usually specified as (Y, I, 7 3 ) with the condition Y = Y\ + Y 2 , h = 

(Jlb + (*2)3. 

Wigner-Eckart Theorem Corresponding to the Wigner-Eckart theorem in S U(2), 
which is expressed as 

(j 2 m 2 \T(JM)\j imi ) = ^ m2 ,M+ mi C^ imi (j 2 ||T^||j 1 > (14.27) 

its analogue in S U(3) is given by 



(^ 2 v 2 |r(^)| AlVl > = W^ £ ^ (^WT'-Wm) 



where the reduced matrix element (/^ 2 i|T / '||//i) depends only on the representa- 
tions involved. The Gell-Mann-Okubo mass formula described in the following 
section is an application of the Wigner-Eckart theorem. 



14.2 

Predictions of SU(3) 

14.2.1 

Gell-Mann-Okubo Mass Formula 

If S 1/(3) symmetry were exact, all the quarks would have equal masses (m u — 
mj = m s ). Although m M ~ mj is approximately true [m p (938MeV) ~ m„ 
(939 MeV)], Am = m s - m u ~ m,i(1116MeV) - m p (938MeV) is fairly large. 
If we consider the mass difference is the origin of S 1/(3) symmetry breaking, the 
Hamiltonian mass term would be 



H m = m u (uu + dd) + m-ss 



The first term transforms as a singlet of S 1/(3) and the second as an eighth mem- 
ber ( I = Y = 0) of an octet and hence is the symmetry-breaking term. Using 
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Eq. (14.29) Gell-Mann [160] derived a sum rule that holds for particles in an octet 
(Problem 14.4). 

l ( m N + ms ) = l { s mA + ms ) (14.30a) 

< = \ ( 3m \ + <) ( 1430b ) 



The agreement with experimental data is very good. Then Okubo, using Eq. (14.29), 
derived a mass formula applicable to any multiplet [300]: 

M = a + bY + c\l(I + 1)- -Y 2 (14.31) 

a, b, c are constants that differ for each multiplet. Equation (14.31) is referred to 
as the Gell-Mann-Okubo mass formula. The reason for using mass squared rather 
than mass itself in Eq. (14.30b) is motivated by the Lagrangian mass term of the 
Klein-Gordon equation. The agreement with experiments is certainly better with 
the mass squared. 

Problem 14.4 

Derive Eqs. (14.30) from Eq. (14.31). 



Problem 14.5* 

Derive Eqs. (14.30) from Eq. (14.29). 



14.2.2 
Prediction of Q 

When flavor S U(3) symmetry was proposed, three members of the (3/2)+ decu- 
plet, zl(1232), 2(1385), ,3(1530) were known. 7 ' They fit as members of the decu- 
ple! However, the last member Q was not known at the time. If S 1/(3) symmetry 
is correct, Q with J p = (3/2)"*" should exist and have I = 0, Q = —1, S = —3. 
Furthermore, applying Eq. (14.31) the following relations are obtained: 

M(Q) - M(S*) = M(E*) - M(S*) = M(S*) - M(A) (14.32) 

We have attached an asterisk to denote that they are members of the decuplet. 
Inserting observed values M(A) = 1232 MeV, M(S*) = 1385 MeV, M(S*) = 

7) The notation for baryonic resonances with different J p is as follows. If they have the same ( I, Y) 
as [N(l/2, 1), 21(3/2, 1), vl(0,0), 2(1,0), 5(1/2, -1) . . .], they are expressed as [N(1440), zl(1232), 
•1(1520), X(1385), 5(1530), . . .] with the mass value in parentheses. 



1530 MeV, we obtain M(Q ) ~ 1680 MeV. The Q mass value is significant. Q can- 
not decay by the strong interaction, because a composite baryon with S = — 3 can 
be constructed from (Si), (AKK), (NKKK), but they all have mass values larger 
than Q: M(S~K) > 1809 MeV, M(AKK) > 2107 MeV, M(NYKK) > 2423 MeV. 
Consequently, Q cannot decay to them because of energy conservation. Decays 
to (A K), (Ex) are allowed, but only weakly because they have S = —2, violating 
strangeness conservation. As a result of this prediction, a bubble chamber team at 
BNL launched a search for Q , and out of 10 5 pictures, they found one with evidence 
of Q production (Fig. 14.6). The picture clearly shows the following sequence of 



> Q + K+ + K° 



I U e + + e 

I +e + + e~ 



+ JT- + P 

K° is not seen in the picture, but from energy-momentum c 
ing mass M^s can be calculated; M miss = m K indicates that it is a K°. The Q~ 
leaves a visible track and its lifetime and mass were calculated to be ~ 10 -10 s and 
1686 ± 12 MeV, confirming the S 1/(3) prediction. This was a result of S U(3) sym- 




Figure 14.6 A picture of Q production obtained in a bubble chamber at BNL [49]. 



516 1 14 The Quark Model 

metry and not of the quark model, but historically the latter was inspired by the 
former. 

Note that so far resonances that appear in the strong interaction having lifetime 
~ 10 -23 s were thought of as dynamical phenomena, not in the same category as 
quasistable particles with long lifetime > 10 -10 s. But from the fact that the Q~ 
belongs to the same decuplet as other resonances, it is clear that the lifetime does 
not play any essential role in the classification of particle species. This is analogous 
to treating all the excited atomic or nuclear energy levels, stable or unstable, on 
equal footings. As quark composites, it is legitimate to treat hadronic resonances 
equally with other hadrons such as p and n. 

14.2.3 
Meson Mixing 

As stated before, flavor S U(3) is not a rigorous symmetry and the Hamiltonian con- 
tains a symmetry-breaking interaction as in Eq. (14.29). It has the same transforma- 
tion property as the neutral member of an octet and has nonzero matrix elements 
between particles belonging to different multiplets but having the same Y and 7 3 . 
For example, it induces mixing between a singlet meson 0o = (uu+ dd + sl)/^/3 
and the Y = h = member of an octet 8 = (uu+dd-2ss)/V6- Fori" 



cf> = cos d <p s -sin 00o 
(0 = sin 6cpg + cos 0<p o 



<p, cd are observed mesons and are mass eigenstates of a 2 x 2 matrix M 2 : 

On the other hand, the mass matrix M 2 ' with bases (</> 8 , O ) can be expressed as 

L^oJ L M 08 M 2 J L0oj 

Using TrM 2 = TrM 2 ', det M 2 = det M 2/ , we obtain 

ml + m 2 = Mo 2 + M 2 

mlml = M 2 M 2 -M 2 0S M 2 S0 

Gell-Mann and Okubo's formula Eq. (14.30), which is valid for no mixing, means 



Using Eqs. (14.33) and (14.36), the following relations are obtained: 



(14.37a) 
(14.37b) 



Problem 14.6 
Derive Eqs. (14.37). 
When sin 6 = 1/V3(~ 0.577, ~ 35.3), 

[2uu+dd-2sl [Tuu+dd + sl 

= Vl — 7s V5 — 7^— = 

[Tuu+dd-2ss [2uu+dd + 
w -ii VS + V3 JV~ 






(14.38b) 









V6 V 3 VI V2 

called ideal mixing. Table 14.4 shows that for vector 1~ and tensor 2+ 
the mixing is almost ideal. For the pseudoscalars, the mixing angle is small 



;. This is because m, ; / > M os and is attributed to the chiral anomaly 
contribution [97, 208, 361], which is an interesting topic theoretically but we do not 
discuss it here. If ideal mixing is realized, (p, j' 2 ~ si, w, f 2 ~ uu+ dd, the former 
is expected to decay predominantly to KK and the latter to 2jz, In (see OZI rule 
below). The observed branching ratios are [311] 



BR(0 -+ KK) ~ 84% , 

BR(co ->■ 3jt) ~ 90% , 

BR(/ 2 ' ->• KK) ~ 100% , 

BR(/ 2 ->• 2jt + 4jr) ~ 95% , 

which confirms the expectation. 



> 3tt) ~ 15% 



BR(/ 2 ->• KK) ~ 4% 
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(a) " » (b) 

Figure 14.7 (a) 4> -* 3jt, (b) -+ K+ K - , K° K° 



OZI Rule The suppression of — ► In and a>, /2 — > KK can be explained by the 
so-called OZI (Okubo-Zweig-Iizuka) rule [211, 301, 400], which claims that decays 
with disconnected quark diagrams are suppressed. It is respresented pictorially in 
Fig. 14.7. 

Since <p is predominantly composed of ss, it can go to KK with connected quark 
lines, whereas the quark lines have to be cut and regenerated to go to 3jt's [note: 
(p —>■ 2jt is forbidden by G-parity ( G = (— 1)'C = —1), see paragraph in Sect. 13.2]. 
This is an empirical rule. The reason for it was clarified with the advent of QCD 
(see Sect. 14.5.4). 



Problem 14.7 

Experimentally, o(K~p -*■ <pA) ~ 100ct(jt — p - 

diagram, show that the OZI rule applies here. 



<pn). By drawing the quark 



The decay process of a vector meson 1 to lepton pairs 11 goes through a photon 
intermediate state and its decay width is proportional to the square of the electric 
charges Q 2 = (£, Q,) 2 



where a is the fine structure constant, xp is the wave function of the vector meson 
and m y is its mass. Assuming m p = m m = m^ and ideal mixing, show the relative 
ios of the decays are 

r P :r.:r, = \:y\ (14.41a) 
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The experimental data are 

r p = 7.04 ±0.06 keV, r m = 0.60 ± 0.02 keV, r p = 1.27 ± 0.04 keV 

(14.41b) 

and agree fairly well with the expectation. 



Derive Eq. (14.40)*. 

Hint: V -*■ 11 can be considered as qq -*■ 11 scattering in which quark flux is 

prepared by the vector meson (see Fig. 14.8). Use: 

1. a(e~e + — >• fi~ fi^~) in Eq. (C.17) with appropriate replacement of the quark 
mass and charge, 

2. The quark flux prepared by the meson is F(qq) = 2/3 q \ip(Q)\ 2 , where /3 ? is the 
quark velocity in the total center of mass frame energy ^/s = m v , 

3. The result would still be a factor 3 off, which can be corrected by considering 
the color factor described below. 



Figure 14.8 A diagram that contributes to V -*■ tl decay. 



14.3 

Color Degrees of Freedom 

The success of S U(3) immediately raised a new problem. Q ~ and A ++ are com- 
posed of sss and uuu, which are symmetric. Their spin is 3/2, which means the 
configuration is TTT an d is also symmetric. Since ground states of the quark com- 
posites are expected to have orbital angular momentum 1 = (states with I > 1 
are generally at higher energy levels), the particles' wave functions in the decu- 
plet are totally symmetric. This contradicts quarks having spin 1/2, which should 
obey Fermi statistics. Therefore, we need to introduce another degree of freedom 
to make the wave function totally antisymmetric, which we refer to as "color". In 
order to make three particles totally antisymmetric, the color freedom has to be 
three. We have already learned that a singlet made from 3 ® 3 ® 3 in S (7(3) is total- 
ly antisymmetric [see arguments following Eq. (14.18)]. We name the three di 
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of color freedom R (red), B (blue) and G (green). The names are arbitrary. We could 
just as well call them r\\, i] 2 and ?; 3 or use numbers. However, it is a very conve- 
nient name, since all the observed hadrons seem to be color singlets, and can be 
called white or color neutral figuratively. 

Evidence for Color Degrees of Freedom There is a great deal of evidence for the 
existence of three colors (N c = 3). 

1) The spin statistics of Q ~ and Zl ++ that we have just stated. 

2) ji° — >• y + Y '■ As the neutral pion jz° is a bound state of qq, i.e. | jt°) = (— | uu) + 
\dd))/^/2 [see Eq. (14.14)], the process goes through quarkloop intermediate states 
as depicted in Fig. 14.9. The decay matrix element is the sum of the two diagrams 
in Fig. 14.9. If the quark has N c colors, the number of diagrams is N c and the 
decay rate has to be multiplied by N c 2 . The quark model calculation with N c = 3 
gives [7, 55] 

r(jt ^2 Y ) = N^(Ql-Q 2 d ) 2 ^^ (14.42) 

where f„ is the pion decay constant defined in Eq. (15.44). Calculation gives 
r(ji° —> 2y) = 7.64 eV, in good agreement with the observed lifetime r exp = 
8.4 ± 0.6 x 10~ 17 s, which gives r exp = fi/t = 7.84 ± 0.6 eV. Without color, the 
theoretical value would be a factor of 9 less than the experimental value. 

3) Drell-Yan process (p + p -*■ 11 + X): In the p(n)-p reactions to make a 
lepton pair with large invariant mass, the production cross section can be shown to 
be proportional to N c (see Sect. 17.9.1) and a comparison with experiment shows 
N c = 3. 

4) J?-value: The R-value is defined by 

q (e - e +-> all hadrons) = 



<[G) 



-ir<-ir 



JS<2m c 

2m c < *fs < 2m\, 

2m h < Vs 



|e Y |e Y 

T e Y T e Y 

(a) (b) 

Figure 14.9 Triangular diagrams: ji° —>■ y + y goes through quark loops. 
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Figure 14.10 One-photon exchange diagram 
R = a(e-e+ -+ hadrons)/a( e -e+ -+ n~ 



for // j.i and hadro 
' + ) = 3E,Qf. 



Feynman diagrams for e e + — >■ /< /< + and e e + — >• all hadrons are shown in 
Fig. 14.10. 

Quarks are confined and it means the quarks produced in e~ e + reaction are 
converted 100% to hadrons. From Fig. 14.10 it is clear that production of a quark 
pair is a pure electromagnetic process and its cross section is proportional to 
times that of (ip production. Multiplying by N c and summing over all the kinds of 
quarks that can be produced gives the R-value. 

Figure 14.11 gives a compilation of R- value data. It clearly shows that at the 
threshold of going over *fs > 2m,, a variety of resonances appear, but that the 
R- value agrees with the predictions at J~s 3> 2m q , q = c.bfor each energy range 
with mass values given in Table 2.2. 



; 



/^ J' 




u+d+s u+d+s+c u+d+s+c+b 



V7 [GeV] 

Figure 14.11 Compilation of R-value data [31 1]. Lines indicated by arrows represent the values 
2, 10/3 and 11/3, respectively. The dashed line indicates the R-value for u + d + s + c + b with 
no color. Lines ]/i/>, T indicate sharp re 
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5) The weak force carrier W ± boson decays into 

W^ev c , fiv fl , tv t , ud' , -cs' (14.45) 

and the quarks hadronize after the decay, d' , s' are Cabibbo-rotated d, s (see 
Sect. 15.6.4). If the masses of the leptons and quarks are neglected compared 
to m w =81 GeV, the partial decay rates are all the same because the coupling of 
the W boson is universal. Therefore, the branching ratio of W —>■ ev e (or any single 
leptonic decay) is 1/5 without color, but 1/9 if color is included. The experimental 
value is ~ 11% and supports the existence of the color degrees of freedom. 



14.4 

S 1/(6) Symmetry 

14.4.1 

Spin and Flavor Combined 

SU(6) Multiplets As the quark has color degrees of freedom, a wave function of 
quark composites can be expressed as a product: 

Total wave function = color x flavor x spin x space 

Assuming all hadrons are color singlets and observed low-lying hadrons are in 
the S states of orbital angular momentum, the remaining degrees of freedom are 
3x2 = 6. If the six degrees of freedom in the flavor and spin are equal and 
if the strength of the interaction does not change by exchange among them, the 
symmetry to be considered is S 1/(6). Base vectors are 

(f \ | 2 ,- •• , | 6 ) = (u f, u I, d T , d I, s t, s 4) (14.46) 

We designate this as 6 or, writing flavor and spin separately, as (3, 1/2). The way 
to make product representations from the above bases is quite similar to that of 
S 1/(3). Meson multiplets are (see Appendix G) 

6 <g> 6* = 1 © 35 = (1, 0) © (1, 1) © (8, 0) © (8, 1) (14.47) 

which means both mesons with J p = 0~,l~ appear in a singlet and an octet. It is 
customary to combine them and call them a nonet. Baryons appear in 

6 <g> 6 ® 6 = 20 A © 70 MA © 70 M s © 56 s (14.48) 

However, the Fermi statistics of the quark allows only totally symmetric 56 s repre- 
sentation. Separating the flavor and the spin, we can re-express it as 

56 = (8, 1/2) © (10, 3/2) (14.49) 

Therefore the baryon (1/2)"*" octet and the decuplet (3/2)"*" exhaust the available 
spectra. Namely, the baryons, unlike the mesons, do not constitute a nonet. Note, 
in Sect. 14.1.5 we stated that there are two baryon octets with different symmetry, 
now we know that combined with spin wave functions they should make a totally 
symmetric wave function (see Problem 14.11). 
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Wave Functions in SU(6) Wave functions of the baryons in flavor octet and those 
of spin are of mixed symmetry. The combined wave functions have to be totally 
symmetric in the S states. As an example, let us consider \p) ~ \uud). With respect 
to the first two quarks it is symmetric, and hence the spin wave function of the first 
two must also be symmetric. A symmetric spin wave function is expressed as 

1,-1) =W (14.50) 

In order to make a spin 1/2 wave function from the product of Eq. (14.50) and the 
third quark, we use the Clebsch-Gordan coefficients to give 



ww-fim-J\*i± 



^ (14.51) 

= ^=pm-(ti + it)t] = *«s 

As the wave function must be totally symmetric, we make a cyclic replacement of 
the particle 1 -» 2 -» 3 and add them all together. Therefore 

\p f> ~ uud[2 fU -(H + 4-t) t] + cyclic 

= -^p»t«t*;-»t»^t-»W*t (1452) 

+ 2utd4wt-w^twt-"tdt«4 

+ 2d|tttut-<Jt»t»Wr«U»t) 

The normalization constant is determined to make (p| jj) = 1. 

Problem 14.10 

I Show that the same wave function is obtained if one starts from the antisymmetric 
wave function for the first two quarks 



Problem 14.11 

Using (p M s = -^{2uud - (ud + du)u} and X ms, <Pma, Xma given in Eqs. (14.51) 

and (14.53), prove that the S U(6) wave function of Eq. (14.52) can be written as 



$ms, ma are wave functions belonging to 8 MS , 8ma, as is clear from the sym- 
metry. <Pms, <Pma of other baryons are given in Table G.2. 
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Magnetic Moment of the Proton The S U(6) wave function is given by Eq. (14.52), 
but the relative spin direction of u and d is the same if cyclic permutation is per- 
formed. Therefore the first term is enough for considering the magnetic moment. 
The normalization, however, is important and has to be redefined each time. Con- 
sequently we consider 

h Ift)] ( 14 - 55 ) 

The magnetic moment of the proton can be calculated as a vector sum of that of 
each quark. From the first term, the proton magnetic moment receives a contri- 
bution from the quarks fi p = fi u + fi u — fid with probability (2/V6) 2 = 2/3. 
From the second term, the proton receives fi p = fi u — fi u + fid with probability 
2 x (1/V6) 2 = 1/3 

.-. fi p = -(2fi u -fi d ) + -fid = -(Afi u -fi d ) (14.56a) 



fin= j(4fti-ft») (14.56b) 

If SU(6) is an exact symmetry, m u = mj, hence fid = —fi u /2, which makes 
HnlH v — —(2/3). The agreement with the observed value (—0.685) is very good 
and was considered a great success of S U(6) symmetry. Here, we go one step fur- 
ther and calculate all the magnetic moments of the 1/2, 3/2 baryons, but regard 
m„, mj, m s as parameters to be adjusted with the observed values. Table 14.5 lists 
the magnetic moments of other baryons together with fitted and observed values. 



Problem 14.12 

Derive the expressions in the second column of Table 14.5. 

We see that all nine magnetic moments are well reproduced with only three 
adjustable variables. The quark masses determined from the above table using 
^ = ^2^ are 

m„ = 338 MeV , m d = 322 MeV , m s = 510 MeV (14.57) 

We see m u ~ md~ m p /3, m s — m u ~ m A — m p . They are called the constituent 
mass (effective static quark mass) in contrast to current mass (see Table 14.5), 
which is determined by the scattering data. The constituent masses are known to 
reproduce static properties of the hadron, including the magnetic moment and the 
mass. The riddle of two different quark masses will be resolved later when we learn 
about asymptotic freedom (see Sect. 18.5.1). It states that quarks with the current 
mass are freely flying at almost the speed of light but confined inside a sphere with 
the effective size of hadrons (< 10 -15 m). In describing static properties, however, 
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derived using SU(6) w 



Baryon 


Magnetic moment 


Theory 


Observed value 3 


P 


(^u-fli)l-i 


input 


2.792847386 ± (63) 


u 


(4/i d -/i u )/i 


input 


-1.91304275 ± (45) 


,1 




input 


-0.613 ± 0.004 


J + 


(4|i B -|i,)/3 


2.67 


2.458 ± 0.01 


vO 


(2/i u + 2^-^)13 


0.79 


- 


X°->-,4 


(ftd-ftu)/Vl b 


-1.63 


-1.61 ± 0.08 


z - 


(4fii-li s )ll 


-1.09 


-1.160 ±0.025 


s° 


(4fi s -fi u )/i 


-1.43 


-1.250 ±0.014 


S~ 


(Afi,-lii)ll 


-0.49 


-0.651 ± 0.0025 


fi - 


iii. 


-1.84 


-2.02 ± 0.05 



b Transition magnetic moment S° -> A. 

the contributions of the quark kinetic energy and the potential made by the color 
force field add up to make the constituent mass. The constituent quarks are almost 
at rest and nonrelativistic treatment reproduces the static properties of the hadron 
quite well. 

14.4.2 
SU{6) x 0(3) 

Excitations of orbital angular momentum with L = 0(S), L = 1(P), I = 3(D), . . . 
are possible. Using the spectroscopic terminology 2S+1 Lj(J FC ) to denote quan- 
tum numbers of the excited states, the known 0~, 1~ are denoted as ^(O •"), 
3 S X (1 — ). For I > meson states, 1 P 1 (1 + "), 3 P (0++), 3 Pi(l ++ ), 3 P 2 (2++), . . . 
have been observed (Fig. 14.12). 



14.5.1 



In November 1974, an incident remembered as the "November Revolution" oc- 
curred. A precursor was the discovery that summer of an anomalously large peak 
by Ting and group at m ee a 3 GeV in a reaction pp -> e~ e + + X carried out at 
BNL (Brookhaven National Laboratory). Its spectrum was so anomalous, the events 
showing a large, sharp peak with practically no backgrounds, that the group was 
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Figure14.12 A table of observed mesons classified by SU(6) X O (3) [251, 311]. Particles 
in the blank spaces have not yet been observed. At the right of each box J pc 's are indicated. 
S', S", P' are radial excitations of S and P. 



not certain about the validity of their observation and repeated tests without ever 
making it public. On the other hand, at SLAC (Stanford Linear Accelerator Center) 
on the west coast, Richter's group had been measuring the total cross sections of 
the e~e + reaction since the summer at the electron-positron collider accelerator 
SPEAR and were frustrated by anomalous behavior of the observed reaction rate, 
which went up and down every time they re-measured with no sign of stabilization. 
Eventually, they noticed that the lack of reproducibility might be due to the inaccu- 
racy of the energy setting. They repeated the experiment with improved accuracy 
and they also observed a very narrow peak at 3 GeV. Ting named it the /-particle and 
Richter named it tp, and today it is known as J/ip. The reason why people made 
a fuss about J/ip was that despite its large mass value of 3 GeV the width was far 
smaller than the experimental resolution of a few MeV. In conventional thinking, 
when the parent's mass is large compared to the offspring's mass (if they are a few 
pions, the total mass is at most 1/10 that of the parent), the parent is expected to 
have a short lifetime because of the large Q-value. In other words, the decay width 
should be at least as large as that of resonances with comparable mass (for instance 
p has mass 770 MeV and width F p ~ 150 MeV). The situation is very similar to the 
discovery of the strange particles. Hot debates ensued, but it was soon realized that 
J/ip is composed of the fourth, "charm" quark; Ting and Richter received the Nobel 
prize in 1975. The short duration between the discovery and the award of the prize 
was second only to the discovery of parity violation by Lee and Yang, indicating the 
large impact that made on the high-energy c< 
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14.5.2 

Mass and Quantum Number of J/ip 

J/ip was produced in the following reactions [33, 34]: 
BNL: p + Be^ J/ip + ■■■ 

J/ip^e~e + (14.58a) 

SLAC : e~ + e + -+ J/ip -+ e~ e + , n~/u + , hadrons (14.58b) 

Figures 14.13 and 14.14 show/ and ip production cross section results. Note 
the scale of the abscissa. In Figure 14.15 the cross section goes down just before 
rising to the peak of the resonance. This is due to an interference effect between 
the photon and J/ip intermediate states in ee ->• fiji and indicates that J/ip shares 
the quantum number J pc = 1 with the photon. The shape of the resonance 
before and after the peak is not symmetric. This is due to the radiative effect, in 
which the electron and positron with larger energy than the resonance mass emit 
photons and lose energy, and the remaining less energetic electron-positron pair 
forms the r< 



Isospin and C-Parity of J/ip If I = 1, the decay to p°JT° is forbidden. This is easily 
seen by looking at the Clebsch-Gordan coefficients of making 1 = 1 from two 
1 = 1 objects. 1 = was confirmed by the decay mode to A A and 

y/y^ * } * (14 .59) 

Since C = (—), this gives G = (— 1)'C = (—). This is also seen by the predominant 
decay into an odd number of pions rather than an even number of pions. 

14.5.3 
Charmonium 

Potentials of the Quarks Following the discovery of J/ip (3096.9 ± 0.1 MeV), a 
higher state xp' = ip (3686.0 ± 0.1 MeV) and its decay products x c o(3415.1 ± 1.0), 
£ c i(3510.6 ± 0.5), x C 2<3556.3 ± 0.4), etc. from \p' -* y% were discovered succes- 
sively. When one makes a chart in terms of J pc (Fig. 14.16), it resembles bound 
state levels of the positronium made of an electron and a positron and thus they 
are named charmonium states. From the similarity of the levels, their structure is 
expected to be almost identical. Namely, it is inferred that a pair of point particles 
with spin 1/2 interacting in a Coulomb-like potential has formed a bound state. 
The level structure is better reproduced (Fig. 14.17) if one uses 

V(r) = — — + kr 7) (14.60) 

7) V = r", -1 < N < 2(l/r,ln r, r, r 2 ) produce similar spectra. However, only V ~ r gives the 
Regge trajectories (J a m 2 ) shown in Fig. 14.18. 
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Figure 14.13 J-particle production in p-Be collisi 
the e~ e+ invariant mass distribution. The exper 
Brookhaven National Laboratory [33]. 



The coefficient 4/3 of the first term is introduced to be consistent with the QCD 
potential, which will be described later [see Eq. (14.86)]. The second term, if extrap- 
olated to r — >■ oo, gives the confining potential, because to separate the two quarks 
to r = oo requires an infinite amount of energy. 

Evaluation of « s For a short distance (r < l/m„ ~ 10 -15 m), the first term is 
dominant and we can drop the second term for a qualitative argument about the 
bound state. 5) Energy levels of a bound particle due to the Coulomb potential (i.e. 
the levels of hydrogen atoms and positronium) are expressed as 



E„ = — — — n: principal quantum number 



(14.61) 



re a is the fine structure constant and /.i = m\m-i\{m\ + m 2 ) is the reduced 
;. The energy difference between the two levels of 3 S x with n = 1 and n — 2 

>e discussed later. For the case of charmonium, 
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Figure 14.14 Producti 
ft-/i+ and (c) «"«+. 
Center [34, 73]. 

is given by 



16 



(14.62) 



where the reduced mass m e /2 is used for the positronium. For charmonium, we 
change A£ -+ 600 MeV, a ->■ (4/3)a s , m e ->■ m c . We set m c ~ 3700/2 MeV, 
because while the decay width of ^'(3686) is narrow, that of V" = V(3770) ->■ DD 
(DD are hadrons which contain a charm quark) is wide (~ 25 MeV), indicating a 
strong decay. In other words, the mass of xp exceeds 2m c between the two levels. 
Inserting numerical values (m e ~ 0.5 MeV, a ~ 1/137), we obtain a s ~ 1. It 
is very surprising that levels at energy scales differing by a factor of 100 million 
exhibit the same structure. Thus, it was clarified that a Coulomb-like force, but 100 
times stronger, is at work between the quarks. 



Confinement and the Regge Trajectory The Regge trajectory represents the phe- 
nomenon that a series of particles with the same quantum numbers other than 
spin are on straight tracks with the same slope on the m 2 -J plane (Figs. 14.18 
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3.090 3.095 

Collision Energy 

Figure 14.15 Interference effect between y sets in (solid line). If the quantum number is 

and J/tp [73]. If J/tp has the same quantum different, no interference occurs (dashed line), 

number as y , the interference effect appears The data show the existence of the interfer- 

in the e~ e+ — >• fi~ft^~ reaction as a change ence. 



and 13.29). We shall show that V(r) = kr can reproduce the Regge trajectories. 
Assume the attractive potential V(r) = kr", that two massless quarks are rotating 
on a circle and that the centrifugal force is in balance with the potential. The mass 
of the two-particle system, i.e. the total energy, is given by 



H = 2p + kr n 



(14.63) 



As they are rotating, they have angular momentum J = 2p r. They are stationary 
at the distance where H is 



Inserting Eq. (14.64) in Eq. (14.63) we have 

H oc _/"/<" +1 > -» J oc M (n+1)/ " (14.65) 

To reproduce the Regge trajectory, n has to be 1. Inserting n = 1 and recalculating, 
we obtain 

m 2 (mc 2 ) 2 

J = (14.66) 

J 4fe Akhc { ' 

where we have recovered h and c in the last equation. From Figure 14.18 the slope 
of the Regge trajectories is obtained to be ~ 1 (GeV) - 2 and 



which is often referred to as the string tension. The work necessary to separate the 
quarks to a distance of 1 m is ~ 10 15 GeV ~ 150 tons x 1 m. It is impossible to give 
that amount of energy to a quark. 
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by using various kind of potentials [101]. The potentials [Eq. (14.60)]. Introducing spin c 

rightmost column shows the levels of a har- plings reproduces the observed spectrum 

monic oscillator (V ~ r 2 ), the second from shown in the fourth column (at the left), 
right is due to a linear potential and the third 
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Figure 14.18 Regge trajectories: A 
their spin are on straight tracks (J 
called the Chew-Frautschi pli 



ibers except 
-J plane. This is 



V(r) = kr means the quarks are attracted to each other by a string with o 
tension. Hence fe is referred to as the string constant. As 1.3 GeV ~ 9m M , an ener- 
gy equivalent of 9jt's is stored if the two quarks are ~ 10 -15 m apart. One notices 
that this is about the size and mass of a hadron. If the string between the quarks is 
stretched, a large amount of energy is stored in it. According to the Heisenberg un- 
certainty principle, pairs of particles and antiparticles are constantly produced and 
annihilated in the vacuum. If a pair is produced on the line, they make two color 
singlets with the original quarks. Then the force between the pair is weaker than 
that between the members of the singlet and the string is cut off. This means that 
making hadrons by extracting quark pairs from the vacuum is more advantageous 
than stretching the string further (Fig. 14.19). Thus, the string stretched between 
two energetic quarks is cut into pieces, producing many hadrons, which is referred 
to as quark hadronization. This is why the quark is believed to be unable to separate 
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(b) 

Figure 14.19 Energy stored between 
quarks increases in proportion to the 
tion and behaves like a string with cor 
tension. If a pair of quarks is produced in 
uum and combines with the original pair 1 
make a new pair of color singlets, the orig 



'o inal color force is terminated there and the 

para- string is cut. In other words, for r > 10 — 15 m 
s more advantageous to cut the string and 
dronize than to stretch the string further, 
my hadrons are created. 



as a single entity (confinement). An energetic quark (and gluon also) is converted 
into many hadrons flying as a bundle in the same direction, which is observed as 
a jet phenomenon. Figure 14.20a shows a typical two-jet event (e~ + e + — >• q + q) 
observed by the VENUS detector of TRISTAN e~ e+ collider [285, 380]. One imme- 
diately observes that many tracks and associated energies are concentrated in two 
opposite directions. Figure 14.20b shows an azimuthal angle distribution of the 
jet axis produced by polarized electrons at SPEAR [195]. The theoretical angular 
distribution of a particle in the e~ e+ reaction is expressed as 



da 






(14.68) 



where P is beam polarization and a = +1 for spin 1/2 particles and a = — 1 for 
spin particles. The data show a = 0.78 ± 0.12, confirming the quark's spin to be 
1/2. 

14.5.4 
Width of J/tp 



The resonance width of the J jxp was narrower than the experimental resolution. 
We discuss a method to determine the width. When there is a resonance, the total 
cross section to a final state /is given by [see Eq. (13.85)] 



> J/V -+ f 

2J + 1 An 

' (2 Sl + l)(2s 2 + \)~p- 



(m: 



r 1 /* 



{E-E R ) 2 + T 2 /4 



Here, r e /r and -T/7-T are the branching ratios of the J/ip decaying to electrons 
or the final /state, S\ = s 2 = 1/2 are electron spins and J = 1. The observed 
value is a cross section convoluted with the experimental resolution d exp . Under 
normal circumstances, F 3> <3 exp and the functional shape of Eq. (14.69) can be 
reproduced faithfully. If, on the other hand, r is much smaller than r5 exp (here, 
d exp /E ~ 0.1%, <5 exp ~ 3MeV » 93keV), the measured cross section is nothing 
but the error function (see Fig. 14.21). 
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Figure 14.20 Quark jets produced in 

of tracks in the central drift chamber. Num- 
bers in rectangles at outer circle are energy 
deposits in the lead glass calorimeter. Lower 
right: Back-to-back emisson of two sharply 



peaked jets is evident in the Lego plot, a 3- 
dimensional display of energy deposits in 
- z plane [285]. (b) Azimuthal angular 
distribution of the jet axis produced by a po- 
larized beam, a = 1 for spin 1/2 and a = — ' 
for spin =0 particles [195]. 



When / = ee, the integral of Eq. (14.69) over energy with p 
1548 MeV/c fixed at the resonance peak gives 



o(E)dE = 






(14.70) 



which should be equal to the area under the resonance curve of a(ee — > e~e+) 
(Fig. 14.14c). As the branching ratio r ee /r = 0.0594 can be measured indepen- 
dently, comparison with the data gives r — 93.2 ± 2.1 keV, r ee = 5.52 ± 0.14keV. 
The smallness of f = 93 keV may be recognized by comparing it with hadronic 
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Figure 14.22 (a) J/ip ->■ In has disconnected charm lines, (b) In V (3770) ->• DD the char 
lines are connected. By the OZI rule, (a) is heavily suppressed. Compare with Fig. 14. 7. 



decays of comparable 



M = 768MeV, 7^ = 152 MeV, r p ^ ce 
M = 782MeV, r m = 8.4 MeV, r a ^ e 
M = 1020 MeV, /^ = 4.4 MeV, r^->.„ 



= 6.8keV 
= 0.6 keV 
= 1.37 keV 



We see the width of J/ip is 1000 times smaller than typical hadron widths and 
almost comparable with the electromagnetic decay widths. 

This was why people were surprised by the discovery of ]l%p. As the mass of 
J/ip is far larger than p, n,A,S, etc., it should decay strongly to pions and other 
light particles, giving F ~ O(100) MeV, unless some selection is at work. Indeed, 
?/; (3 S) (3772.92 ± 0.35) has a broad width ~ 27.3 ± 1.0 MeV and predominantly 
decays to D+ D~ or D° D (Fig. 14.22b). The D particles were discovered in e~ + 
e + ->• D + D~, D°~D [114, 177, 178, 321] and they are quasistable particles. Their 
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masses and lifetimes are 

M(D ± ) = 1869.6 ±0.2 MeV , t(D ± ) = 10.4 ± 0.07 x 10" 13 s (14.72a) 

M(D°) = 1864.8 ±0.17 MeV, t(D°) = 4.10 ± 0.015 x 10 -13 s (14.72b) 

These phenomena cannot be explained if these particles contain only the known 
quarks u, d, s. They can be understood if J/ip, i/»(3770) are assumed to be made of 
new quarks cc. The fact that D particles are heavy (m ~ 2 GeV) and yet quasistable 
can be understood if they contain c or c. We now know that D + = cd, D~ = Id, 
D° = cu, D = cu. The c quark has a new quantum number "charm" that is con- 
served in the strong interaction, hence it cannot decay strongly to ordinary particles 
containing only u, d, s, but it can decay weakly. J/ip has total charm quantum num- 
ber zero and the decay mode conserves the charm quantum number, but drawing 
quark lines of the decay, the charm line has to be closed (see Fig. 14.22a). This is 
because m(J/tp) < 2m D and no process satisfying the OZI rule (see Sect. 14.2.3) 
exists. On the other hand the charm quark line in the decay ?/>(3770) — >■ DD is 
connected and hence the OZI rule is satisfied. 

In the language of QCD, where the quark lines are disconnected, they are con- 
nected by gluon lines. Each time a gluon is exchanged, a factor a s /2jt ~ 0.1, 
where a s = g 2 /Ajt, has to be multiplied. For J/ip three gluons are exchanged 
(Problem 14.13). Therefore the decay rate is suppressed by (a s j2ji) 6 compared to 
ordinary strong decays. 

Problem 14.13 

Explain why there have to be at least three gluons in the intermediate state in 

Fig. 14.22. 



14.5.5 

Lifetime of Charmed Particles 

The fourth quark was proposed as early as 1964 based on baryon-lepton symme- 
try [196, 270], 10) but it was not until 1970 that it was taken seriously, when Glashow, 
Iliopoulos and Maiani showed that the absence of strangeness-changing neutral 
weak current can be explained if the existence of the charm quark is assumed (GIM 
mechanism [174], see Sect. 15.7.1 for details). It constitutes a doublet in the weak 
interaction paired with the s quark. If D° is made of the charm quark in a weak 
doublet, it can decay predominantly to the s quark by emitting a W + weak boson, 

10) The charm quark itself was discovered by airplane. The Nagoya group analyzed it 

Niu [297] prior to the November revolution. regarding it as the fourth quark. However, as 

His group found a short-lived particle it was a cosmic ray event with poor statistics, 

r ~ 10~ 14 s, decaying to ji° and a charged definitive, the discovery was largely ignored 

particle in an emulsion stack placed in an outside Japan. 
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only as a spectator, h 
iple of hadronic decay D+ - 



producing a variety of decay products (Fig. 14.23). Let us consider a semi-lepton- 
ic decay D° — > K~ + 1+ + v\(l = e, fi). Here, the u quark does not participate 
in the decay, playing only a spectator role, and the process can be considered as 
c — >■ s + 1+ + vj, which has identical kinematical structure to /< — »• e + v e + v fl , 
differing only in the mass values. Since the lifetime of the muon is a (G^m 5 ) 
[see Eq. (15.84)], the lifetime of D is estimated to be roughly 

t d ~BR(D ->■ K~e + v e )( — ) r„ 

[ e, \mJ " (14.73) 

~ 0.036 x (0.1/1.5) 5 x 2.2 x 10" 6 s ~ 10" 13 s 

which reproduces the experimental value (r Dexp = (4.10 ± 0.02) x 10 -13 s) approx- 
imately. This shows that the D meson decays weakly. 

14.5.6 

Charm Spectroscopy: SU(4) 

If there are four quarks, the flavor symmetry S L/(3) is extended to S U(4). However, 
the mass of the charm quark ~ 1.3 GeV 11 ' is far greater than others, and S 1/(4) is 
not considered as a good symmetry. Nevertheless, it is useful in classifying and 
predicting the existence of hadrons containing the c quark. We briefly describe 
multiplets that are generated by S 17(4) (see Appendix G for S 17(4) algebra). The 
meson nonets in S U(3) become 16-plets in S 1/(4). 

Charmed Mesons 

4 <g) 4* = 1 © 15 (14.74) 

Among the 15 members of 15, we already know eight members, which do not 
contain c. The remaining seven members contain a c quark and are called charmed 

1 1) This is the current mass, which is somewhat 
smaller than the 
drastically small 
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mesons. They are 



\c= 



D°(cu),D+(cd), D s (cS) 



D (cu),D-(5d),D s (cs) 



Their configuration in h-Y-c space can be expressed as in Fig. 14.24, (a) for 
and (b) for 1~. All of them have already been discovered. 



; 



; 



: 



(14.76) 



The two 20 s with mixed symmetry correspond to (1/2)+ and the totally symmetric 
20 s corresponds to J p = (3/2)+ . Consequently, there are 12 charmed baryons with 
(1/2)+ and 10 charmed baryons with / ? (3/2)+ (Fig. 14.24c,d). A + ,I C , S c and Q c 
have been observed. 



n \Xu 


W.i r 




__/k} 


l — i 


4 


(a) 






Figure 14.24 Charm spectrum: 
baryon20. (a) CT (b) l^.The. 
of (a) and (b) represent C = rr 



except for ^ andJ/V areSU(3) multiplets. 



on 16 and (c) (1/2) + , (d) (3/2) + . The plane at the Dot- 
al planes torn represents C = baryons, which already 
appeared in SU(3). 
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14.5.7 

The Fifth Quark b (Bottom) 

A repeat of almost the same procedure was used to discover upsilon T(bb bound 
state) in 1977 [213] shortly after the charm discovery 

p + p -+T + X 

V -a. + (1477) 

and later, a series of upsilons were discovered [323] at DESY (Deutsches Elektronen- 
Synchrotron) and at CESR (Cornell Electron Storage Ring) [24, 71]: 

e~ + e + -> T(9460) , T'(10023) , T"(10355) , 7""(10580) (14.78) 

Among them the first three had narrow widths and V" had a broad width, indicat- 
ing wif, ~ 5 GeV. Figure 14.25 shows the first three upsilons. 

The level structure of the T's is quite similar to that of the charmonium and col- 
lectively they are called bottomonium. However, corresponding to the larger mass 
mj ~ 5 GeV, bound states are formed deep in the potential, and up to T(3S) are 
quasistable (Fig. 14.26). For the observed hadrons that include the b quark, refer to 
the Particle Data Group table [311]. 



14.6 

Color Charge 



The discovery of the color degrees of freedom was the key to the true understanding 
of the strong force. It was also the discovery of the "color charge". In electromag- 




9.96 10.00 10.04 

eV MASS (GeV) 



Figure 14.25 T, 7", T" observed in e 
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Figure 14.26 Level structure of bottomonium [311]. 



netism, the electric force is generated by an electric charge and the magnetic force 
by its current. Since current is nothing but a moving charge, the electric and mag- 
netic forces are unified by the relativity principle. The current is a Lorentz four 
vector and its 0th or scalar component generates the electric force and is spin inde- 
pendent in its rest frame. The space component, a vector, generates the magnetic 
force, which acts on (couples to) the current (the moving charge) or more generally 
angular momentum, including spin. This characteristic is closely related to the fact 
that the photon is a Lorentz vector. Just like the electromagnetic force, the strong 
force (or more rigorously color force to discriminate it from the nuclear force) is 
generated by the color charge and its current. This is why color force acts like the 
Coulomb force at short distances. Since the characteristics of the two forces are 
very much alike, the color force is expected to obey the same mathematical frame- 
work as that of the electromagnetic force, namely gauge theroy. The difference lies 
in the number of charges. There are three color charges (R, G, B), which gener- 
ate the strong force with equal strength. The action (or emission/absorption of the 
force carrier, which is referred to as the gluon) changes the color. So we can infer 
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that the characteristics of the color force are basically similar to those of the electro- 
magnetic force with added effects due to color exchange. We have already learned 
that the nuclear exchange force stems from the exchange of pions, which carry 
isospin. Unlike isospin, whose symmetry is S U(2), color comes in three kinds and 
its symmetry is S 1/(3). This is the origin of color S U(3) and, unlike flavor S 1/(3), 
it is considered to be an exact symmetry. 12 ' 

From the above arguments, it is natural that a gauge theory based on S 1/(3) has 
emerged as a strong candidate for governing the strong interaction. Because of 
its similarity to QED (quantum electrodynamics), the mathematical framework of 
the color force is called QCD (quantum chromodynamics). A strict formalism for 
constructing the QCD Lagrangian will be given in Chap. 18, but here we show that 
a lot of insight into the QCD features can be obtained by simple color counting 
if we restrict our arguments to the lowest order corrections in the gluon emission 
or exchange, namely as long as we do not touch the nonlinear aspect of QCD. 
Much of the elementary dynamics can be obtained using QED formula we already 
know with additional color factors. To obtain a glimpse of the power of QCD, we 
consider a color exchange potential and how low-lying mass levels of the hadron 
are reproduced. 

The force carrier, the gluon, comes in eight kinds. Let us try to understand why. 
Analogy with the nuclear force helps because this is also an exchange force obeying 
S 1/(2) symmetry that alters the identity of the force source - p (proton) or n (neu- 
tron) - by the force action. The nuclear force carrier pion comes in three kinds. If 
we omit the space-time structure, the nuclear force interaction is expressed as 

H = gsj^,i>ji>«i = y [V2(pnji+ + npjt~) + (pp - nn)ji°] 



,- L ;j. (,+ ,^^-^^j ,14.7*, 

where r; is the Pauli matrix. The pion field operators (;r + , ?r , n~) have the same 
isospin function as {pn, (pp — nn)l~Jl, np } and change the isospin component of 
the force source xp. For instance, absorption of jt+ changes n to p (i.e. jt + ~ Jn). 
Actually there are four combinations to change isospin components (p, n) to (p, n), 
but the combination (pp + nn)/*j2 is totally neutral (isospin singlet) and does not 
count. Therefore, the number of force carriers is 2 2 — 1 = 3. In mathematical 
language, the pion belongs to the regular representaion of the S 1/(2) group. In 
correspondence to the isospin exchange hamiltonian, one that generates the color 
exchange interaction can be expressed as 

8 _a. r R i 

H = gs X>ytMi, rp= G (14.80) 

12) At long range, the confinement force becoms effective, which is a nonlinear effect of the color 
charge. 
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Xi is the Gell-Mann matrix [Eq. (14.8)] and is the regular representation of S U(3). 
A i is the force field, referred to as the gluon. There are 3 2 — 1 = 8 gluons (A ;). The 
absorption or emission of the gluon changes the color charge or the color state of 
the quark (R ** G ** B <-»■ R), just as the pion changes p -o- n. The gluon is an 
eight-component vector in color space, just as the pion is an isovector in isospin 
. 
Unlike the pion, the gluon is a Lorentz vector, like the photon, and its spatial 
part induces the magnetic interaction. This is the origin of color and spin exchange 
forces, which will be described shortly. 

14.6.1 

Color Independence 

We show the color independence of the quark-gluon coupling strength just 
like the charge independence of the pion-nucleon coupling (recall arguments 
in Sect. 13.3.1). 

We first reformulate the gluon fields to grasp an intuitive image of the gluon' s 
role. Just as we used (;r + , jt° , n~) instead of abstract (tTj, jt 2 . ^3), we define gluons, 
gi's (i = 1, . . . , 8), as follows. Omitting the space-time part of the field subscripts 
for simplicity, we write down the Gell-Mann matrices explicitly: 

-*QCD=g.Z>(A./2)lM« 

= i| \~RGgr + GRg 2 + 4=(«* " G G)g 3 + RBg 4 



7! (A '- 


-iA 2 ) 


, g2 = ^(A 1 + iA 2 ) 


>- 


-iA 5 ) 


' Ss = ^=(A4+iA s ) 



The g a 's are redefined gluon fields. Equation (14.81a) means, for ii 
the ability to change its color G — >• R if absorbed by G, in other words, as far as the 
color degrees of freedom are concerned, it plays an equivalent role to R G . This is 
analogous to jr + being ~ np. 

Let us calculate the strength of the force that acts between two quarks using the 
Lagrangian in Eq. (14.81a). First of all, between RR we can draw two Feynman di- 
agrams, shown in Fig. 14.27a,b. Here, at the vertices R is annihilated and recreated 
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g 8 =^(RR+GG 2BB) g 4 = RB g 5 = RB 

(d) (e) 

Figure 14.27 Color independence, i.e. universality of the quark-glu< 



(i.e. R -► R ~ RR). Among the glue 
couple to RR. The strength of the g3, . 



■[(a" ♦(*)]■ 



is in Eq. (14.81b), one : 
; exchange is given by 



where g' = g s j\fl. Next we calculate the strength of the BB force (Fig. 14.27c). 
Looking at Eq. (14.81a), we see only g 8 couples to B B, consequently 



V67 



(14.83) 



This has the same strength as the R R force. Considering the universality of the 
forces acting between R, B, G, it is logical to obtain the same strength. Next we 
consider the R B force. First, in addition to the contribution (Fig. 14.27d) due to g 8 , 
which couples to both R and B, we have to consider the R B exchange force due to 
g4, gs (Fig. 14.27e). Combining the two 



+ (l)x(l) 



(14.84) 



e sees that the universality is maintained. Next, let us consider the force ii 
.glet meson (RR +GG+ ~BB)/Vi- 



544 I 14 The Quark Model 




g 8 =^ (RR+ GG 2BB) g 4 = RB g 5 = RB 

(a) (b) 

Figure 14.28 Gluon exchanges between BB in a color singlet m 



g 6 = GB g 7 = GB 



Since each force that acts on the three pairs is equal, we only consider an ini- 
tial state BB and multiply by 3. But as the weight of RR in the singlet meson is 
(1/V3) 2 = 1/3, all we need to do is to calculate the BB contribution. Consider- 
ing BB can be converted to BB, RR, GG, there are three diagrams to consider 
(Fig. 14.28). As the gluon is a vector and changes its sign for the antiparticle, we 
have three contributions from each diagram: 



4m- 



+( i )(- i )+( i )(- i > 



(14.85) 



V6/ 

The formula tells us that there is an attractive force between the quarks forming 
a singlet meson. This is the strength of the color force that acts between q and q , 
corresponding to that of the Coulomb force in QED. Writing a s = g 2 /4jz, we have 



3 



(color force in QCD) <->■ a (Coulomb force in QED) 



(14.86) 



Now we have reproduced the coefficient of the Coulomb-like force that was intro- 
duced in Eq. (14.60) to explain the charmonium spectrum. 

14.6.2 

Color Exchange Force 

The calculations we have done are intuitive and easy to understand but rather com- 
plicated. In general, we rely on group theory. The force acting on qq or qq can be 
expressed as a potential. Expressing it in scattering matrix form, we have 

M (c + J -> b + V) ~ gJJ>&'| [lp h (X A /2) hc xp c ] n/v^A^vW] \cc') 

~g?E< bb 'i F ^Aico 



where we have suppressed all irrelevant factors except color indices. F 1A , F 2 a are 
representation matrices of S U(3) generators that act on particles 1 and 2. If we take 
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the sum over color indices A, Eq. (14.87) reduces to 

(E F ia^a) = (Fi • F 2 ) = 1 -((F 1 + F 2 ) 2 -F\- F\) 

\ a I L (14.88) 

= 1 -(F 2 -F 2 1 -F 2 2 ) 

which is a Casimir operator of S 1/(3) [see Eq. (G.50) in Appendix G]. This is the 
S 1/(3) extension of the isospin exchange force t\ ■ t 2 [see Eq. (13.27)]. If we assume 
that three-body forces are the sum of two-body forces, the sign and strength of its 
potential are proportional to 

CV = E< F '- F J> = ^( F2 -E F 7 F = F 1 + F 2 + F i (14.89) 

Values of Cp [refer to Eq. (G.51)] are given in Table 14.6. By surveying the table, one 
notices that a strong attractive force acts on color singlets and qualitatively repro- 
duces the feature that only color singlets are observed in nature. The force acting 
on q q [3* ] has to be attractive, being a part of q q q and making a singlet by coupling 
with q[5]. The coupling among qqqq[3] is the same as for qqq[l], meaning that 
attaching an extra q to qqq does not give a stronger attraction. Values of the force 
strength given in Table 14.6 are based on one-gluon exchange only. They cannot 
be considered as quantitatively correct as they do not take into account nonlinear 
couplings, which are special features of QCD. Nevertheless we are pleased to see 
that they give qualitatively correct results. 

14.6.3 

Spin Exchange Force 

Before we go into the details of the color force, we review the space-time structure 
of the force. From what we have studied so far, the color force acting on the quarks 

Table 14.6 Color factor £(Fj • fj). 
Configuration Multiplets Strength 
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is like the Coulomb force, at least at distances comparable with the hadron size. 
This suggests that we may acquire some insight into the level structure by studying 
that of the hydrogen atom, which is a bound state as a result of the Coulomb force 
(Fig. 4.1). The levels are obtained by solving the Dirac equation, but the principal 
part can be obtained from the Schrodinger equation with a central force a/ r to give 

E n = -%-!^ n = l,2,... (14.90) 

where fi is reduced mass. For hydrogen mi = m p » m 2 = m e and fi ~ m t . 

The solutions of the Schrodinger equation having the same principal quantum 
number n but different I and spin have the same energy, for instance (2S,2P), 
(3S,3P,3D), . . . are degenerate. Solving the Dirac equation, which takes into ac- 
count spin-orbital angular momentum interaction ( I S force) and spin-spin inter- 
action (S S force), we find the levels split. Splitting due to the IS force is called 
the fine structure, and that due to spin-spin coupling the hyperfine structure. The 
LS force arises from the magnetic interaction between the magnetic moment fi L 
produced by the circulating electron and the electron spin magnetic moment fi e . 
The Hamiltonian is given by 



H "~-^ = -(^-(^)~ 



■ s (14.91) 

The hyperfine structure arises from the spin-spin interaction, which is a magnetic 
interaction produced by spin magnetic moments of the electron and proton: 

H ss ~ -fi e ■ n p = U^e) " (spJ^ s p) ~ ~£^^ ■ s p < 14 ' 92 ) 

The interactions exchange the spin component of the interacting particle and are 
called spin exchange forces. As is clear from Eqs. (14.91) and (14.92), the different 
magnitude of the level splitting between the fine and hyperfine structures stems 
from the different masses of the electron and proton. In fact, the positronium 
shows the same structure as the hydrogen atom, but m p is replaced by m e , thus 
the LS and SS forces generate approximately the same magnitude of splitting. 
Calculation of the hyperfine splitting using quantum mechanics gives (see for in- 
stance [63]) 

£ss = tS" IV;(0,|2si ' 5j (14 - 93) 

ip(Q) is the value of the wave function at the origin. Since 



^ (Si + sjV- 



*j] = ^(s+lJ-Sito + lJ-Sjfsj+l)] (14.94) 
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where s is the total spin, the spin exchange force produces the energy difference 
between different s, hence the hyperfme splitting. 

Since the gluon is a Lorentz vector just like the photon, we expect the same kind 
of forces to be generated by the gluon. That is to say, the spin-independent color 
electrostatic force is expected to generate the principal level splitting differentiated 
by n (= 1,2,.. .) and the color magnetic force is expected to generate the fine struc- 
ture, which is different depending on the value of I and S (not to be confused with 
the strangeness). The IS force is absent for I = states. 

14.6.4 

Mass Formulae of Hadrons 

Now we extend the level splitting force to include the color exchange. When the 
S 1/(3) operator is written as F(= F-i, F 2 , ■ ■ ■ , F 8 ), the gluon exchange includes both 
color and the (real) spin exchange force. The Hamiltonian of the two-quark inter- 
action is expected to have the form 

H co i or ~ (F a ■ F b )(s a ■ s b ) 

c c V^n v X- (14 ' 95) 

F a -F h = 2_F ai F hi , s a -s h = 2_s aj Sbj 

Comparing this with the interaction of the hydrogen atom, we may interpret it as 
the color magnetic moment interaction that is generated by three color charges. 
Considering the mass and color charge strength, we see the Hamiltonian will have 
the form 

H = H -Aj2^^(Fa-F h )(s a -s h ) (14.96) 

where m a , mj, are the quark masses and a s = g 2 J4-Ji is the color coupling con- 
stant. In the above expression, we considered only low lying states with orbital 
angular momentum 1 = 0, hence there is no IS force and we are dealing with hy- 
perfme splittings. H generates the confining force, which has principal quantum 
numbers. The color exchange term (F a ■ Ft) was evaluated in Table 14.6. Since all 
the hadrons are assumed to be color singlets, constituent quarks in the meson on 
which the operator (F a ■ Ft) acts are in a singlet state. For baryons, it acts on a pair 
out of three quarks, but as the pair forms a singlet with the remaining q (3), the 
quark pair belongs to 3*. We obtain 

t— 4/3 meson 
(Fa-F b )=\ (14.97) 

(—2/3 baryon 
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If the quark masses are degenerate (m u = mj = m s ), we can sum over spin 
components. For mesons 

(-3/4 0" 
meson: (s a ■ s h ) = \ (14.98a) 

( 1/4 1 



baryon: J^( s " ' s b) = \ 



-3/4 (1/2)+ 
3/4 (3/2)+ 



Therefore, the mass value differs depending on the spin. The above calculation 
gives m(\~ ) > m(0~ ), m(3/2) > m(l/2) and the relative separation is given by 

m(l-) - m(O-) > m(3/2) - m(l/2) (14.99) 

which is consistent with observations. If there is no color exchange, the levels for 
mesons or baryons are inverted, contradicting the observations. 

Problem 14.14 
Derive Eqs. (14.98). 

If we assume that the masses of hadrons and quarks are large compared to their 
binding energy (nonrelativistic approximation), the hadron mass in the first ap- 
proximation [H in Eq. (14.96)] can be expressed as a sum over quark masses. 
Then we have 

baryon: m B = m + m 2 + m 3 + B [<^> + <^> + ^^1 
[ m 1 m 2 m 2 m i m^m^ J 

= ^ m . + ^_A_ (s .. s . ) (w.iooa) 

meson: m M = my + m 2 + (sy ■ s 2 ) (14.100b) 

mim 2 

For baryons there are a (1/2)+ octet and a (3/2)+ decuplet and considering the 
isospin degeneracy (m u ~ mj), there are eight levels. Here, as an example, we 
calculate the mass of /l (1/2) - 2(1/2) - 2* (3/2), which contain an s quark. We 
set 1 = s, 2, 3 = u,d, and m 2 = m 3 = m u = mi, my = m s . Note, the total 
wave function including both flavor and spin should be totally symmetric. Since 
the quarks 2 and 3 make I = 1 (2,2*) and I = (A), the spin wave functions 
must be s = 1 (I = 1) and s = (I = 0), respectively. For 

y^ s > • s j _ Sl'Si Sy ■ (S 2 + S 3 ) 
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we obtain the following: 

2(s 2 • s 3 ) = (s 2 + s 3 ) 2 -s\-s\ = s 2i (s 2i + 1) - 3/2 



(«2 • S 3 ) = 



1/4 s 23 = l (2,2*) 
(-3/4 s 23 = (A) 
»i • (»2 + S3) = (1/2) {s 2 - s 2 - (s 2 + S3) 2 } 

= (l/2){s(s + l)-3/4-s 23 (s 23 + l)} 



s(s + l) 

S23(S 2 3 + 1) 



(15/4 s = 3/2 (S*) 
(3/4 s = l/2 (I, A) 
2 s 23 = l (1,1*) 
s 23 = (/l) 

(l/2 (S*) 

;. Sl • (s 2 + s 3 ) = j -1 (2) (14.102b) 

(0 (A) 

Inserting Eqs. (14.102) in Eqs. (14.101) and (14.100a) we obtain 

m(S*) = 2m u + m s + B f^L + J^-J (14.103a) 

m(2) = 2m u + m s + B (^ - -^A (14.103b) 

m(A) = 2m u + m s + £(-^J- ) (14.103c) 

V 4m 2 J 

Other baryons can be calculated similarly to get 

m(A) = 3m u + b( -J (14.104a) 

m(N) = 3m u + b( -J (14.104b) 

m(E*)=m u + 2m s + B^^ + jJ—\ (14.104c) 

m(E) = m u + 2m s + b(-^ — J (14.104d) 

m(£2 ) = 3m s + B ( - ) (14.104e) 

\4mjJ 

There are eight equations with three adjustable parameters. The mass formulae 
for mesons can be calculated in a similar manner. We compare our results with 
observed values in Fig. 14.29 and Table 14.7. 
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Figure 14.29 Levels of the baryon splitting due to spin-spin coupling [101]. (a) Levels du 
quark mass splitting, (b) (c) spin-dependent force resolves the degeneracy. 

Table 14.7 Mass values calculated including color-color and spin-spin couplings [296]. 

Baryon Theory Exp. Meson Theory Exp. 



1020 For baryons 

892 m u = m d = 363 MeV 

783 m s = 538 MeV 

776 B = 200 m\ MeV 

549 For mesons 

496 m u = m d = 310 MeV 

138 m s = 483 MeV 

C = 640 ml MeV 



Problem 14.15 
Derive Eqs. (14.104). 



Problem 14.16 
Calculate mesor 



ing Eq. (14.100b). 



The quark masses determined from baryons and mesons may seem different, 
but considering the simple approximation we used, they are surprisingly close. 
Note that those determined from baryon magnetic moments [m u = 338 MeV, 
md = 322 MeV, m s = 510 MeV, see Eq. (14.57)] are in between. These mass for- 
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mulae agree with the Gell-Mann-Okubo mass formula extended to S 1/(6) [192] 

m= A+ BY + C[I(I + 1)- Y 2 /4]+ DS(S + 1) (14.105) 

to first order in power expansion of A m = m s — m u . 

Mass Formula Including Orbital Angular Momentum We have to include the spin- 
orbit coupling (LS force) to calculate the masses of states with orbital angular mo- 
mentum L > 0. A representative formula, given by [117], has the form 

H = L(r u r 2 , r 3 ) + £ (m ; + l±^j - 2 -a s £ Sy (14.106a) 

_ 1 1 f Prfj , ( r -Pi)( r -l'j)l 



r (s rSj )(5 3 (r)4 



f {3( Sl -r)( Sj -r)-( Sl - Sj )} 



_ 2H h^ (! '' s,, "^ (!j " SjH 

+ (relativistic correction) 



: {(U ■ sj) - (ij- s^} \ 

(14.106b) 



where r = r/\r\, I = r x p. The first term of Eq. (14.106a) gives the confinement 
potential and phenomenologically a harmonic oscillator potential is often used. 
The second term is the mass and kinetic energy term and the third term Sij is 




N*l/2-A*l/2-A*l/2-Zn/2-H*l/2-n*l/2-N*3/2-A*3/2-A*3/2-Z*3/2-3*3/2-n*3/2-N*5/2-A*5/2-I'5 



Figure 14.30 A comparison of the baryon mass prediction and experimental 
the nonrelativistic quark model [214]. Hatched boxes are observed data of 
and the black bars represent the prediction. 
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the "Breit interaction" (see for instance [58]) commonly used in nuclear physics. 
The second term in the first line of the Breit interaction is referred to as the Dar- 
win term and the third term as the zitterbewegung, the second line represents the 
spin-spin interaction and the third line represents the spin-orbit interaction. Phe- 
nomenologically, however, it is known to reproduce the observed baryons' mass 
and width when the spin-orbit interaction is neglected and only the spin-spin in- 
teraction is included. In the previous calculation, we used the first term in the spin- 
spin interaction term. As an example, Fig. 14.30 [204, 214] shows a comparison of 
the calculation with the observed baryon mass spectrum with L = 1 and negative 

With A*(l/2)~ as the only exception, the agreement is remarkable. The number 
of parameters was five. 

It is not well understood why a nonrelativistic potential can reproduce the ob- 
served data so well." 1 However, the phenomenological success in hadron spec- 
troscopy can certainly be considered strong evidence for the validity of the quark 
model and QCD. 



3) Because of the asymptotic freedom, which will be explained in Chap. 18, the high-energ 
phenomena of QCD are fairly well understood, but the low-energy part of QCD needs 
nonperturbative approaches and remains a little-explored area to this day. 



15 

Weak Interaction 



All hadrons and leptons are known to take part in the weak interaction, but its effect 
is felt only when the strong and electromagnetic interactions are forbidden by some 
selection rules. Since weak interaction phenomena were hard to reproduce experi- 
mentally, the progress in clarifying the weak process was slow at the beginning. But 
its very weakness made the perturbative approach effective and clear connections 
of theoretical ideas with observations were possible. Fermi's theory, which was pro- 
posed in 1934 and adapted to V-A interaction in 1957 to take parity violation into 
account, reproduced observed experimental data well. However, it is based on an 
effective Lagrangian valid only in first order and, lacking self-consistency, has no 
predictive power for higher order corrections. Nonetheless, its similarity with the 
electromagnetic interactions inspired the pursuit of a self-consistent mathematical 
framework modeled on QED - the only self-contained gauge theory at that time 
- and finally led to the unified theory of the electromagnetic and weak forces. 

To help comprehend the logical flow that is described in the following, we start 
from known basic principles of the weak interaction, which serve as background 
knowledge, then follow in the footsteps of our great leaders, because it is instructive 
to learn how they gained insight from observations. It is especially illuminating to 
understand the similarities to and the differences from the electromagnetic force 
at each step. The purpose is to learn heuristic methods as well as to study the facts 
and logic of elementary particle physics in the weak force world. 



Ingredients of the Weak Force 

The weak force is generated by the "weak charge", just as the electromagnetic force 
is generated by the electric charge. This means there is a "weak static force" as 
well as a "weak magnetic force", the former produced by the static weak charge 
and the latter by its current, i.e. the moving charge. Correspondingly, the force 
carrier (denoted as W) is a four-vector and couples to a four-current j£, (weak 
current and H WI = g w j^,W /l ) generated by the motion of the weak charge carrier 
(i.e. quarks and leptons) just as the photon is a Lorentz vector that couples to the 
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electric current (H EM = ej^A^). The coupling constant g w is the strength of the 
weak charge, corresponding to the unit electric charge e. 

The first major difference is that the weak charge comes in two kinds, to be de- 
noted temporarily as Qwi, Qwl which we call the "weak isospin doublet". For 
example, the electron-neutrino has Q W i and the electron has Qw2, and together 
they form a weak doublet (v e , e~). The term "weak doublet" is used to distinguish 
it from the isospin doublets that appeared in the strong interaction. The naming 
is not accidental, though. The generators of the nuclear force, the proton and neu- 
tron, constitute an isospin doublet (p, n) and change their identity by the nuclear 
force action (referred to as the exchange force), because the force carriers (i.e. the 
pions) are charged. Like the pion, the W boson also comes in three kinds and con- 
stitutes a weak isospin triplet (W + ,Z°, W - ). 1 ' Since the lepton is a fermion and 
fermion number is known to be conserved, we assign lepton number 1 to all lep- 
tons and —1 to all antileptons. The sum of the lepton number is conserved in the 
weak as well as in other interactions. The action and reaction of the weak force 
(absorption and emission of the W) change the identity of the weak charge carri- 
ers. For instance, by emitting W + (or by absorbing W~), v t changes into e~ . The 
six known leptons are classified into three doublets. A notable feature of the weak 
force is that it changes the lepton only into itself or into the other member of the 
same doublet. Namely, the three doublets in the lepton family (v c , e~), (v fl ,/u~), 
(v T , z~) are independent. They do not mix with each other, which is referred to as 
lepton flavor conservation. 2 ' 

Quark doublets: [j] [<] [>] 

Lepton doublets: [*] [£| [£| 

The situation in the quark sector is more complicated. Its known six members also 
constitute three doublets in the weak interaction. However, mixing among them, 
known as CKM (Cabibbo-Kobayashi-Maskawa) mixing, occurs. 

The second major difference between the weak and electromagnetic forces is 
their reach. The force range r is determined by the Compton wavelength of the 
force carrier. Since the masses of W ± and Z° are 

M w = 80.40 ± 0.025 GeV , M z = 91.186 ± 0.002 GeV (15.1) 

r is ~ 2 x 10 -18 m. For practically all reactions we deal with in this chapter, the 
energy in a process is much smaller than m w . This means that the mass of W, Z 
can be regarded as infinite or equivalently the force range as zero. Namely, the 

1) The reason for denoting the neutral member But in this chapter the distinction of W° 

of the W family not as W° but as Z° is from Z° does not matter, 

because it is a mixture of W° with the 2) The neutrino oscillation, discovered in 1998 
photon, changing its identity and at the same (to be discussed in Sect. 19.2.1), mixes them, 

C( iiing a different mass as well as but mixing among the charged leptons has 

coupling strength from its charged partners. not been observed. In this book we assume 
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weak force is effective only when the participating particles are in direct contact 
with each other. 



15.2 

Fermi Theory 

15.2.1 
Beta Decay 

Neutrino Hypothesis The weak interaction was first discovered in nuclear beta (j3) 
decays back in 1896 by Becquerel. In alpha (a) and gamma (y ) decays discovered at 
the same time, the emitted particles had a monochromatic energy spectrum, which 
agreed with the difference of the energy between the parent and the daughter nu- 
clei. However, in /3 decay, the energy of the emitted /5 rays, i.e. of the electrons, 
was continuous and there was no sign of other particles. Namely, the law of con- 
servation of energy seemed to be violated. Bohr was willing to abandon the law 
of conservation of energy, but Pauli proposed in 1930 that a yet unknown parti- 
cle (neutrino) carries away the missing energy. In 1934, taking into account Pauli's 
neutrino hypothesis, Fermi proposed the first weak interaction theory to explain 
the phenomenon. Its formalism was made in analogy with the electromagnetic in- 
teraction. The gamma decay process, in which a proton in a nucleus emits a photon 
(Fig. 15.1a) can be described using an interaction Lagrangian 

£em = -qj'"A fl = -epyf'pA^ (15.2) 

Here, p,~p, A^ are field operators of the proton and the photon. Namely, in the 
quantum field theoretical interpretation, the operator p annihilates a proton and 
A fl creates a photon and at the same time ~p recreates a proton. The /3 decay is a 
process in which a neutron in the nucleus is converted into a proton and at the 
same time emits an electron-neutrino pair instead of the photon (Fig. 15.1b): 4 ' 

n -+ p + e~ + v„ (15.4) 

In terms of the field operators, the process can be described by a four-fermion 
interaction Lagrangian 

*? = -GpUhJAJrr = -G (i (p Y "n)(-ey fl v) + h.c. (15.5) 

3) The proton is not a Dirac particle, as it has have to be included from the ph 

a large anomalous magnetic moment and is logical viewpoint. Equation (15.2) 

with the anomalous magnetic moment 4) It is actual' 

fi p = g p e/2m p terminoloj 

X = &f^ ^-°" r V„ Ay ~ S V A„) (15.3) 
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on Emission (3 decay (Fermi) Standan 



u 




(a) 

Figure 15.1 The weak interaction was con- The /? decay in the Standard Model. A d quark 

ceived in analogy with the electromagnetic in the neutron is converted to a u quark with 

interaction, (a) Afield theoretical illustration the emission of e~v e . Note the W ± cou- 

of a photon emission, (b) Fermi interaction: pies only to left-handed particles, hence the 

Instead of y , an (e~v e ) pair is emitted, (c) subscript L 

where h.c. stands for the hermitian conjugate and is added to make the Lagrangian 
hermitian. We omit the subscript e from v in the four-fermion interaction until 
the difference between v c and v M is discovered. This was the first application of 
Heisenberg and Pauli's quantized field theory. In the case of photon emissions, 
quantum field theory could be considered as a reformulation of known formalism, 
but in the ft decay the electron and the neutrino were literally created inside the 
nucleus. Gp is called the Fermi coupling constant and is a measure of the weak 
force strength. The Lagrangian is a product of two currents, hadronic and lepton- 
ic, and is called current-current coupling. To distinguish them from the electro- 
magnetic current, they are called weak currents. The subscripts h,l attached to the 
currents are abbreviations of hadronic or leptonic. Unlike the electromagnetic cur- 
rent, which does not change the identity of particles, the weak currents change the 
electric charge [n ->■ p and v ->■ e~ or (vacuum ->• e~ v)\ and are called charged 
currents. In this sense the electromagnetic current is a kind of neutral current. 
In the Standard Model, the charged current couples to ¥ : 

*wi = gw [it fl w+ + j-" W~] + h.c. 

= gw [(u L r"d L ) W+ + (e L y f 'v eL ) W" /( ] + h.c. (15.6) 

Here, W fl is a field operator to annihilate (create) W +(_ ' bosons. Beta decay in 
the Standard Model is the two-step process depicted in Fig. 15.1c, whose effective 
Lagrangian can be expressed as [see for instance Eq. (7.12)] 

£sM,2ndorde r = g\ (u L y"d L ) , f^ 2 fAy'V eL ) (15.7) 

where q 2 = (pj — p u ) 2 is the momentum transfer squared. Since the energy differ- 
ence of the nucleus before and after the decay is a few MeV and can be neglected 
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compared to m w =80 GeV, the Lagrangian can be rewritten as 

£SM, 2ndorder - -Gfi (u L y"4) fay ' „V Vl ) , Gp = |f- (15.8) 

which essentially reproduces the Fermi interaction, except that only left-handed 
particles participate in the Standard Model, as is indicated by subscript L to the 
field operators. The left-handed nature of the weak interaction will be explained 
later in connection with parity violation. The Lagrangian of this type is called the 
four-fermion interaction. 

Allowed Transitions To obtain the rate of fi decay, it is necessary to calculate the 
transition amplitude from an initial to a final state: 



T fi = Gp j d 4 x{p\p(x)y"n( X )\n}{ev\I(x)y fl v[x] 



For a while, we follow the historical development, treating the proton and neutron 
as elementary particles. If we consider them as represented by plane waves, the 
nucleonic part of Eq. (15.9) is 

(p\j h f, (x)\n) ~ u(p')y"u(p) e -^-^ x (15.10) 



where p, p' are four-momenta of the proton and the neutron. As the Q value (en- 
ergy difference before and after the reaction) of the /? decay is at most a few MeV 
and overwhelmingly small compared to the nuclear mass, the recoil of the nucleus 
can be neglected. Referring to Table 4.2, only the time component survives in the 
nonrelativistic limit and is proportional to 1: 

J d i x(p\p(x)y"n(x)\n) ~ 6 ^ (p')u(p)d^ - p) = d fl0 (l) (15.11) 

In practice, the proton and neutron are in a nucleus. The expectation value in 
Eq. (15.11) should be calculated not by using the free nucleon wave functions but 
by using initial and final wave functions of the nucleus. As the operation includes 
a change of a neutron to a proton, the operator is not exactly 1 but lr+, where 
T ± = ( T x ± ir y )/2 is the raising (lowering) operator of the isospin. 

As for the electron and neutrino, they must be treated relativistically because 
their mass is much smaller than typical /3-decay energies. Namely, the leptonic 
current is e: 



{cv l \JS{x)\0)~THp l )Y' l v{py)e t f-+">-* (15.12) 

Therefore, the transition matrix element becomes, after integration over the time 
variable, 

T fi = 2nd(A - E e - E v )G p j d i x0*(x)0 I (x)e- i ^' + P^ x 

* [«(,.) Aw] (15 ' 13) 

A = M(Z,A)-M(Z + 1,A) 
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where 0/ , <P F are wave functions of the nucleus before and after the decay, re- 
spectively, and A is the mass difference of the parent and daughter nucleus. As 
\P C \' \Pv\ ~ a f ew MeV ' tne d e Broglie wavelength of the electron and neutrino is 
~ 10 -13 m, which is much larger than the nuclear size ~ 10 -14 m. This means 
{Pe + Pv) ' x ^ 1 in the overlap region of the nuclear wave functions. Therefore 
the higher powers in the expansion 



■»(!»« + !»»)■* + - 



(15.14) 



can be neglected in the first approximation. The allowed processes under this con- 
dition are called "allowed transitions". If some selection rule is effective and the 
first term does not contribute, one needs to consider the higher order terms (for- 
bidden transition), but we do not discuss them here as they are more relevant for 
nuclear physics. For the allowed transition, the nuclear part 



(1> = 



f d'x 0; 



c part and is a constant that does 1 
ft Value Calculating the decay rate from Eq. (15.13), we have 



is separated from the leptoi 
dynamical variables. 



(15.15) 
t include any 



dr = Gj\(l)\ 2 Y L 00 2jtd(A -E e - E v ) ?' I v - (15.16a) 

P' WI *-< v '(2jr) 3 2E e (2jr) 3 2E„ v ' 

J2 I 00 = £*(j».)y v.(Pv)v,(pv)y Ur(j>.) = Tr[ t/ eY ° P'rY ] 

= 4 [2p° P ° v - g 00 ( Pe ■ p,,)] = 4£ e E„(l + v c cos 6) (15.16b) 

where v e = p e /£ e is the electron velocity and 6 is the angle between the electron 
and the neutrino. Z~2 sp - m L flv is given in Eq. (6.143). Integration over the phase space 



(5(A- 

p v E v = 



i e — E v )p v dp v = 
(A-E e ) 2 ifn 



dr = Gj\(l)\ 2 (l + vcosfl)-— -pE{A- E) 2 dEdQ e dQ v (15.18) 

Here, we have put p = p e , E = E e , v = v c . From the expression we know that 
the electron energy spectrum is wholly determined by the phase space volume. We 
define 



F(Z,E)pE(A- EfdE 
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F(Z, E) is a correction factor due to the electron Coulomb wave function and is ; 
known function [280]. When the electron is nonrelativistic it is approximated by 



For light to medium elements F ~ 1, /grows in proportion to ~ A 5 and for a small 
change of A (i.e. Q-value of the decay) the lifetime changes drastically from 10 -8 
to 100 s. If we use the half lifetime t 1/2 = ln2//\ the total decay rate is written as 



J 2jt j t 1/2 



We define the ft value by 

A/2 = ^l (15.22) 

J ' G||<1>P [ ' 

Asf | (1) | are calculable using the nuclear shell model and tin is an observable, we 
can determine Gp. 

Fermi and Gamow-Teller Transitions In allowed transitions, the orbital angular 
momentum of the lepton pair relative to the nucleus vanishes and if there is no 
spin flip in the nuclear transition; the angular momentum transfer from the initial 
to the final state of the nucleus is A J = with no parity change. This is called 
the Fermi transition. In actual /3 decays, transitions with the angular momentum 
transfer equal to one (A J = 0, ±1, but — >• is forbidden 5 ' ) with no parity change 
(Gamow-Teller transition) have been observed, hence the original Lagrangian pro- 
posed by Fermi was not sufficient to explain all the observed /3 decays. Further- 
more, parity violation was discovered in 1957 and its effect must also be taken into 
account. The most general form of the four-fermion interaction Lagrangian for ft 
decay can be written as follows: 

*/i = -G li J2[(P rn )(Crer i v+ C'-er iY 5 v) + h.c] (15.23) 

r = l(S), y"(V), a""(T), y"y 5 (A) , iy 5 (P) 

where S, V, T A and P represent scalar, vector, tensor, axial vector and pseudoscalar, 
respectively. By setting Cy = 1 and C; = C[ = for all others, Eq. (15.23) reduces 
to the original Fermi interaction Lagrangian Eq. (15.5). For more general argu- 
ments, interactions including derivatives have to be considered, but we omit them 
as the inclusion of derivatives changes the energy spectrum and there is no sign of 
such terms. Since C;, C' are complex numbers, there are 20 parameters in total. 
Using the transformation property of each term (see Appendix F), one sees that 
C[ = for P invariance, Ci(C[) are real (pure imaginary) for C invariance and all 
Q, C[ must be real for T invariance. 

5) See Problem 13.1 (3). 
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Problem 15.1* 

Prove the above statement. 

In the real world, P and C are maximally violated and T (or CP if CPT ii 
ance holds) invariance is almost true. In the following discussions, \ 
invariance until proven otherwise and set all C;, C[ as real, reducing the number 
of parameters to 10. For a while we proceed assuming P, C invariance as well, and 
set C[ = 0. In this case, the number of parameters is 5. In the approximation 
neglecting nuclear recoils, the transition matrix elements can be calculated using 
Table 4.2, and we obtain 

T fi ~ Gp[{l) {C s u(p e )v(p„) + C v u(p e )y°v( Pl ,)} 

+ {a) ■ {C A u(p e )y°av(p v ) + C T u(p e )av(p,,)} ] (15.24a) 

(1) = I d , x&r(s)T + & I {s) (15.24b) 

(a) - j d i x0;(x)at+0 I (x) (15.24c) 

Resultant expressions for the decay rates have the same form as Eqs. (15.18) and 
(15.21) but the matrix elements squared |(1)| 2 and the angular correlation coeffi- 
cient 1 are replaced by [156, 280] 

|<1>| 2 ^C F 2 K1)| 2 +C 2 T K<7>| 2 (15.25a) 

C 2 = |C s | 2 + |C v | 2 + ^C s Cv (15.25b) 

C 2 T =|C A | 2 +|C T | 2 -^C A C T (15.25c) 

1 + v cos 6» ->■ l + lvcos6> 

. (-|C S | 2 + |Cy| 2 ) |(1)| 2 + (1/3) (-|C A | 2 + |C T | 2 ) |( ff )| 2 

" = c 2 ki>| 2 + c 2 tK .>| 2 (15 ' 25d) 

If the parity- violating terms are included, all the coefficients are replaced by 

IQp-HQf + lC,'! 2 (15-26) 

Later, |C'| = |Q| was clarified, and consequently Eqs. (15.25) are equally valid 
after the discovery of parity violation if we use G^/V^ instead of Gp. The reader 
may have noticed that P-type interaction is missing. It does not contribute to the 
allowed transitions. This is because P-type coupling is equivalent to derivative cou- 
pling in the low-energy limit (see Table 4.2 and discussions in Sect. 13.6.1); the 
matrix element is proportional to the recoil momentum of the nucleus, hence can 
be neglected. When the interaction type is S or V, the selection rule is the same 
as V, namely A J = AP = 0, and is referred to as the Fermi transition. When 
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the interaction is A or T, the electron and the neutrino carry away the angular mo- 
mentum 1 generated by the spin flip nuclear transition. This is referred to as the 
Gamow-Teller transition, in which A J = 0, ±1, AP = 0. 



Prove that if F(Z, E) ~ 1, E e » m e , then / = A 5 /30 and the total decay rate 
becomes 

G 2 A S 

r=-^{Cl\{\)\ 2 +C 2 G1 \{o)\ 2 ) (15.27) 



Determination of the Interaction Type When the initial and final states of nuclear 
transitions are known, the mixing rate of the Fermi and Gamow-Teller transitions 



C F 2 |(1)| 2 



C 2 \(l)\ 2 + C 2 GT \(a)\ 2 v ■ ' 

is a function of | C G t/ Cf | . If the ft value is measured as a function of F, | C G t/ C f 
can be determined [233]. The experimental values are given by 

G/i = 1.14730 ± 0.00064 x 10" 5 GeV 2 

|C GT /C F | = 1.2601 ± 0.0025 (15.29) 

When one interaction type dominates, the angular correlation coefficient X is given 
by 

A = -1(S), =+l(V), 
= 1/3(1), =-1/3 (A) (15.30) 

Measuring the angular coefficient X for various nuclei, and plotting it as a function 
of mixing parameter F, one can determine which of the interactions are effective 
(Fig. 15.2). Historically, the interaction type was determined this way. The result is 

C s = C T = , 

C v = 1 , C A ~ -1.260C V (15.31) 

The sign of C A was determined by angular correlation of (ev) and of the neutrino 
with respect to the spin of free polarized neutrons [82, 83]. 

Beta Decay of the Neutron When the initial and final states belong to the same 
isospin multiplet, the values of (1), (a) can be calculated without knowing the 
nuclear wave functions. For the neutron fi decay, \p), \n) are eigenstates of isospin 
I = 1/2, 

(l) 2 = 1, (a) 2 = 3 (15.32) 
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Figure 15.2 Electron-neutrino angular correlation coefficients plotted against the mixing pa- 
rameter F of the decays in the allowed transitions [280]. 

As A = m„ — m p = 1.29 MeV is the same order as the electron mass m e = 
0.511 MeV, the expression for T in Eq. (15.27) is not usable, but integration off 
and consideration of the weak magnetism, which will be explained later, gives a 
value that agrees well with the experimental value (x n = 885.7 ± 0.8 s). 

15.2.2 

Parity Violation 



Probably the most shocking event in the history of particle physics was the discov- 
ery of parity violation in 1957. Today any conservation law is not sacred, considered 
as an entity to be checked by observations, but at that time it was beyond imagi- 
nation that such a fundamental law as space-time symmetry could be broken. The 
first indication was the so-called x-B puzzle. The particle, known as the K meson 
today, decays to 3jt (t particle) at one time and to 2jt (B particle) at another. Both of 
them were known to have spin s = 0. Since the pion is known to have spin-parity 
J p = — , to make J = in the 2jt combination, the orbital angular momentum 
has also to be zero, resulting in positive parity. In the 3jt system, taking I as the 
orbital angular momentum of 2jz and L as that of the third n relative to the center 
of mass of the In, I + L = J = must hold. This means I = L and the parity 
is P = (— 1) 3 + '+£ = _i. So x and 8 must be different particles. However, the 
mystery was that they had the same mass and the same lifetime, indicating they 
were one and the same particle. 
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Lee and Yang investigated past experiments and found that there was ample ev- 
idence of parity conservation in the strong and electromagnetic interactions but 
no evidence in the weak interaction. They proposed several experiments to test the 
validity [258]. 



How to Test Parity Violation The momentum p changes its sign by parity transfor- 
mation but the angular momentum J ~ r x p does not. Therefore if an observable 
such as J ■ p is found in a decay process, it is evidence of parity violation. Let p be 
the momentum of decay electrons from a polarized nucleus having angular mo- 
mentum J. Then J ■ p is proportional to the cosine of the electron emission angle 
with respect to the polarization axis. Therefore, by observing /? decay electrons 
from polarized nuclei, one can test the existence of the term J ■ p. It appears as 
up-down asymmetry with respect to a plane perpendicular to the polarization axis. 



Wu's Experiment C.S. Wu, who was at the same university as Lee and Yang, made 
polarized 60 Co by cooling it down to 0.01 K and measured the angular distribu- 
tion of the decay electrons. Figure 15.3a shows her apparatus [392]. By applying 
a weak magnetic field (a few hundred gauss 6 '), electrons in the outer shell of 
the atom can be polarized. The nucleus is polarized dynamically by spin-spin in- 

6) One gauss (1 Gs) is 1(T 4 tesla (10" 4 T). 
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Figure 15.3 (a) Apparatus used to discover between the y anisotropy and the asymmetry 

parity violation, (b) Observed data. 60 Co(5 + ) of the electron is evidence that the asymmetry 

/? decays to 60 Ni(4+) followed by gamma exists and is proportional to the polariza- 

transitions 4+^2+^ 0+. The agreement tion [392]. 
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teraction with the electron. The degree of nuclear polarization was determined 
by measuring simultaneously the anisotropy of its y radiation in the transition 
S0 Ni(4+)^2+ ^0+: 

Wt-rr ll\ — \X/I(\\ 

(15.33) 



W(it/2) 

Here, W(6) is the relative intensity of the emitted gamma ray at with respect to 
the polarization axis. Since 60 Co(5+) — ► S0 Ni(4 + ) is a Gamow-Teller transition, we 
obtain the angular distribution by setting C; = C- = 0, i / A in Eq. (15.23). The 
resultant angular distribution formula is 

S(0)~ 1 + APvcosO 
2Re( C A C'*\ 

where P is the polarization of the Co and v the electron velocity. The experimental 
result was P = 0.65, v ~ 0.6, APv = -0.38, which means 

A = -1± 0.1 =*> C A ~-C A (15.35) 

Therefore the parity violation is maximal. It also shows the maximum C violation, 
because C A is not pure imaginary with magnitude as big as C A . It was later con- 
firmed that for V type /5 decay also 



Inserting Eqs. (15.31), (15.35) and (15.36) in Eq. (15.23), one can express the La- 
grangian for /3 decay as 

Zp = -7/§[Py"(l " 1.26y 5 )npy„(l - y 5 )v] (15.37) 

The factor 1/a/2 is inserted in order to use the same Gp as before the discovery of 
parity violation. The Lagrangian (15.37) contains V and A type with approximately 
equal strength and opposite sign and is commonly referred to as V-A type. We have 
now obtained an important characteristic of the weak interaction. Because of 1 — y 5 , 
as far as the lepton is concerned only components with left chirality contribute to 
the interaction. For the notion of chirality, see Sect. 4.3.5. We defer consideration of 
the hadrons (or quarks), where the axial vector coupling differs by as much as 26%, 
until we discuss the conserved vector current (CVC) hypothesis in Sect. 15.5.2. 

15.2.3 

it Meson Decay 

We determined the coupling constants of S, V, T and A from /? decays, but as 
the P type does not contribute in allowed transitions, we need to investigate pion 
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decay. Let us first calculate the decay rate using the interaction Lagrangian (15.37) 
as determined by /J decay. Almost 100% of the pions decay as 

tT^iT + v, jt + ^[i + + v (15.38) 

with lifetime [311] 

r = 2.603 ± 0.0005 x 10 -8 s (15.39) 

A conspicuous feature of pion decay is that the branching ratio of n — >• e + v is 
extremely small: 

rtn ± ->■ e ±C 'v) 

BR(jt ± -► e ± v ) = —^ = 1.230 ± 0.004 x 10" 4 (15.40) 

1 ' 7>± -> all) l ' 

Considering its larger phase space volume, one would naively expect a larger decay 
rate than for jt ->• fiv, an expectation not fulfilled by observation. To investigate 
the reason, let us write down the variables in the re 



Jt(k) ^y l(p) + v(q) (15.41) 

Since the muon has the same properties as the electron except for its mass, we 
infer its emission can be treated in the same way as the /? decay. Therefore, we 
simply make the replacement e — >• fi in Eq. (15.37) and the interaction Lagrangian 
should have the form 

£wi = -^kW^ - U6y s )npy„(l - y s )v] (15.42) 

The transition amplitude becomes 

Tjj = {/tv\ - Z m \n) = (2^) 4 <5 4 (fe -p-q) 

(15.43) 

The hadronic part contains corrections due to strong interaction (wave functions 
made by quarks) and cannot be calculated easily from first principles, but using 
symmetry arguments some restrictions can be applied. From Lorentz invariance, 
the hadronic part should be able to be expressed as 

(0\jy"(l - 1.26y s )n\jt) = -1.26(0|py"y s n|«) = -ik" f K (15.44) 

The first term in the parentheses does not contribute because the pion has negative 
parity. The restriction comes from the vector nature of the current and that the pion 
momentum is the only vector that appears in the process. f„ is referred to as the 
pion decay constant. From energy conservation and the Dirac equation 

k„0>, g|ATy"(l - y >|0) = (p + q) /l u(p)y"(l - y 5 )v(q) 

, I 5 - 45 

= m / ,u(p)(l-y 5 ) V (q) 
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Therefore the transition matrix is written as 

T fi = (27r) 4 <5 4 (fe -p- d-^i/.m^pMl - Y>(q) (15.46) 

The decay rate can be calculated using the y -matrix algebra given in Appendix C, 
and becomes 



' 8?r ml 



(15.47) 



Since Gp is already given by /3 decay, the measurement of the decay rate gives 

f„ = 131.74 ± 0.15 MeV ~ 0.94m., (15.48) 

The decay rate F(ji — >• ev) is obtained simply by replacing m /( — > m e , 
. r(jr^ev) _ m\{m\- 



r(jz^tiv) m 2( m 2 T _ 



j = 1.284 x 10" 4 (15.49) 



Including the radiative corrections, the theoretical value becomes 1.233 x 10 — 4 [231] 
and agreement with the experimental value Eq. (15.40) is very good. 

Why the decay rate is proportional to the mass squared of the charged lepton 
can be understood as follows. In the rest frame of the ji + , the neutrino and the 
charged lepton fly away in opposite directions (Fig. 15.4). Since the neutrino has 
zero mass, the factor (1 — y 5 ) forces the neutrino helicity to be 100% negative. 
(Note, in ji~ decay the antineutrino is produced, which has 100% positive helicity.) 
Since the pion spin is zero, angular momentum conservation forces the ^+ to 
have negative helicity, too. As I"*" is an antiparticle, the negative helicity component 
is suppressed by a factor m\\p in the negative chirality state (see arguments in 
Sect. 4.3.5). Therefore, if m; = 0, the reaction is forbidden completely. It survives 
owing to its finite mass but with suppression factor ~ m/p . The origin of the 
helicity suppression is 1 — y 5 . 

That the lepton in the weak interaction has the factor (1 — y 5 ) can also be con- 
firmed by the direct observation of the muon helicity in the n ->• fi v decay. Experi- 
mental data [140] show 

0.9959 < \h(fi + )\ < 1.0 (15.50) 
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15.3 Chirality of the Leptons I 

Exclusion of P-Type Interaction The pion decay rate also severely constrains the 
existence of the P-type interaction. If the Lagrangian is of P (r = iy s ) type, the 
decay amplitude becomes 

T ft ~ GpC P (0\piy s n\jt}u(p)iy 5 (l - y 5 )v(q) 
= G li C P f^u(p)(l-y s )v(q) 

Note, no lepton mass appears in the expression. This is because the P-type inter- 
action flips the chirality (see next section) and the suppression factor disappears. 
Therefore, if the P-type dominates, the rate will be determined largely by the phase 
space volume and the electron decay rate will be larger than the muon decay rate. 
Quantitatively, the decay rate ratio when the Lagrangian contains an extra P-type 
interaction is given by 

F („ -> ev) = | 1- (m e /m^ | 2 I C P £ - (m./m,)'CW, | 2 

Comparing the formula with the experimental value, one can easily understand 
that the P-type interaction is highly excluded. In order not to contradict the experi- 
mental value we need | Cp| 2 /| CV-a| 2 5 10 — 4 . 

Summarizing the result, one may say that the characteristic of the V-A interac- 
tion manifests itself most conspicuously in the pion decay mode. Historically, the 
determination of the weak interaction type took a long and meandering road. At 
one time erroneous experiments puzzled people as to which of the solutions V, A 
or S, T, P should be adopted. Logically, however, it is much clearer to grasp various 
features if one simply looks at the helicity of the decay leptons. Simply from the 
fact that the decay lepton has negative helicity [h(v) = —1, h(l) = — vj] while that 
of the antilepton is opposite [h(v) = +1, h(7) = +v;], one can conclude that the 
parity is maximally violated and that 

C s =C T =Cp = 0, C A = -C V 7) 

(15.53) 
C[ = -Ci i = V, A 

which will be discussed in the next section. 



15.3 

Chirality of the Leptons 

15.3.1 

Helicity and Angular Correlation 

The significance of Eq. (15.37) cannot be overestimated. If the neutrino has van- 
ishing mass, (1 — y 5 )v implies a two-component neutrino and that the chirality of 
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the lepton that appears in the weak interaction is always negative. This means the 
helicity of the emitted e~, v must be — v, +1 (for e + , v, the opposite is true). We 
notice that the V, A- type interaction conserves the chirality, but S, T, P inverts it. To 
see this we note that the y matrix has the following properties: 

(15.54a) 
(15.54b) 



V,A 
S,T, P 



This equation means that the measurement of the helicity of the emitted lepton 
can distinguish STP from VA. Once one accepts the fact that the neutrino has 
negative chirality, the fact that the angular correlation coefficient has value X = 1 
for V and A = —1/3 for A [Eq. (15.25d)] can be understood as follows. In the V-type 
interaction A J = and the ev pair has total angular momentum J = 0. Since the 
helicity of the electron is negative and that of v is positive, they tend to fly in the 
same direction to have J = (Fig. 15.5). Namely they want to have 6=0, hence 



AJ AJ 1 

(a) (b) 




(c) 

Figure 15.5 Explanation of angular correla- zero, therefore they align themselves in the 

tion between e and v. The negative chirality same direction, while in the Gamow-Teller 

forces the electron to have h(e~) = —1 transition (b), AJ = 1 and the opposite is 

and h{v) = +1. In the Fermi transition (a), true. The asymmetry in Co decay can be ex- 

AJ = and the two leptons have total spin plained in the same way. 
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X > 0. For A- type interaction, A J = 1 and, the total spin of ev being 1, they want 
to fly in opposite directions (6 = n), hence A < (Fig. 15.5). The factor 1/3 reflects 
the spin degree of freedom. 

In a similar manner, the reason for the asymmetry A = — 1 in Wu's cobalt exper- 
iment can be explained. In the process Co(5+) ->• Ni(4+) (Fig. 15.5c), A J = 1 and 
the total spin of ev has to be 1. They have to be emitted in opposite directions and 
their total spin has to be aligned along the same direction as the parent Co's spin. 
This means A < 0. 



15.3.2 

Electron Helicity 

The helicity of the electron can be measured after converting the longitudinal po- 
larization to transverse polarization. There are several methods. We list a few here. 

1. The electrostatic deflector changes the momentum but does not change the 
spin direction (Fig. 15.6). 

2. Applying both the electric and the magnetic field perpendicular to the electron 
motion in such a way as to keep the electron's path straight, the magnetic field 
can induce spin rotation according to 



i— = [o,H] = [<,,-? ■B]=i\p\ax B 
3. Use Coulomb scattering as described in Sect. 6.7.4. 



(15.56) 



The transverse polarization can be measured using Mott scattering with nu- 
clei. The electrostatic interaction produces symmetric scattering but the magnetic 
(spin-orbit coupling H = —gs- 1) interaction produces left-right asymmetry in the 
scattering. The magnetic field is induced by the motion of the electron and acts on 




Figure 15.6 Measurement of the electron 
helicity. The helicity is determined by first con 
verting the longitudinal polarization to trans- 
verse polarization and then measuring the 
left-right asymmetry of the Mott scattering. 



Here, an electrostatic deflector (a) 
The magnetic interaction of the sp 
asymmetry in the scattering and it: 
can be calculated using QED. 
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a 



Figure 15.7 (a) Scattering by the Coulomb force is symmetric, (b) Scattering by the spin-orbit 
coupling force is asymmetric. 
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ron helicity from allowed /? decays, /I is 



the spin magnetic moment of the electron, because, in the electron rest frame, the 
target nucleus produces an electric current. 

The reason for the asymmetry production can be understood classically as fol- 
lows. As is shown in Fig. 15.7b, let us define the x-axis in the direction of the elec- 
tron motion. Assume the electron is polarized in the z direction. When the electron 
is on a trajectory with y > 0, ( = r x p is along the — z direction, whereas when 
the electron is on a trajectory with y < 0, ( = r x p is along the +z direction, and 
the Hamiltonian changes its sign. In other words, the force (produced by a nonuni- 
form magnetic field) is either attractive or repulsive depending on y > or y < 0, 
and always bends the electron in the same direction, producing the asymmetry. 

Measurements showed the electron's helicity to be h(e~) = — v and that of the 
positron h(e + ) = +v and confirmed the negative chirality of the decay electron as 
shown in Fig. 15.8 [147, 280]. 



15.4 

The Neutrino 



15.4.1 

Detection of the Neutrino 
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The neutrino is the only particle that does not interact strongly or electromagnet- 
ically. The weak interaction, literally, couples weakly to matter, meaning not on- 
ly that its strength is weak but also that it occurs very seldom. It took nearly 30 
years after Pauli's prediction before it was actually detected by Reines and Cowan 
in 1956 [105, 330]. Today the neutrino is mass produced in reactors as well as in 
accelerators and even a neutrino telescope has been set up to watch supernovae. 
However, Reines and Cowan's method of detecting the neutrino is still valid and is 
being used in a forefront experiment to confirm neutrino oscillation. Therefore, it 
is worthwhile describing it here. The scheme of their detector is shown in Fig. 15.9. 
The process used is an inverse /? decay and the produced secondary particles were 
detected using the follow 



e + p { u)^(e++W-) + p(u)- 






(15.57a) 






> y + y 

► 109 Cd* - 



09 Cd4, 



(15.57b) 



where we have denoted participating quarks in parentheses. Neutrinos produced in 
a reactor were used for the detection. They are actually antineutrinos produced in 
the /3 decays of fission products. Because of lepton number conservation, they can 
only be converted to positrons by (virtually) emitting W~ , and the target proton 
(actually a u quark in the proton) is converted to a neutron (d quark) by absorbing 
W~ . As the positron is an antielectron, it is annihilated immediately by an electron 
in matter, producing two gamma rays with total energy 2m e . The recoil n 



Photomultipliei 



n n n n n 




Liquid Scintillator 

u u u u u 



Figure 15.9 Layout of the apparatus used to 
detect the neutrino in 1956. The antineutrino 
produced in /? decays of nuclear fission prod- 
ucts reacts with a proton and emits a positron 
and a neutron. The positron immediately an- 



nihilates an electron in matter, emitting two 
7 rays. The slow neutron loiters for a while, 
eventually being captured by a Cd nucleus to 
emit another y [105, 330]. 
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nonrelativistic and is scattered many times by matter before it is captured by a Cd 
nucleus, thereby emitting another gamma ray. A time as long as 10 -4 s will have 
passed and the neutron "totters" ~ 1 m. The gamma rays kick electrons (Comp- 
ton scattering) in matter and the recoil electrons, in turn, emit bremsstrahlung 
photons, creating pairs. These electrons are detected by scintillation counters. The 
signal of the neutrino reaction is thus the occurrence of a prompt signal produced 
by the positron followed by a delayed signal produced by neutron capture. The ex- 
periment was carried out at the Savannah River Plant in South Carolina, USA. 

The detector consisted of a water tank sandwiched by liquid scintillators, which 
were viewed by photomultipliers. The tank contained about 200 liters of water with 
45 kg of cadmium chloride (CdCl 2 ) dissolved in it. It was calculated that about 2 x 
10 14 neutrinos were emitted per 1 kW of the reactor power and the flux injected into 
the detector was 5 x 10 13 s _1 cm -2 . The intensity was 100 times that of the solar 
neutrino, but the reaction rate was extremely small and suffered severely from false 
signals created by cosmic rays. To date, the cosmic ray background is a serious 
problem to be considered in designing an experiment. This was especially true 
for the first experiment. Eventually, taking the difference between mesurements 
with the reactor in operation and not in operation, the experimenters confirmed an 
event rate of one per 20 min. 

15.4.2 

Mass of the Neutrino 

The neutrino mass, i.e. whether it vanishes or not, has been a central issue in 
clarifying the properties of the neutrino. It still is, because a nonvanishing neutrino 
mass is a sign that the Standard Model is incomplete. There is evidence that it is 
finite because neutrino oscillation, in which a neutrino changes its flavor during 
flight, has been observed (see Sect. 19.2.1). However, the oscillation is sensitive 
only to the mass difference and determination of its absolute value is a forefront 
research goal as it offers a clue to new physics. 

A most promising reaction that may be able to determine the neutrino mass 
directly is the nuclear fi decay with a small Q value. The energy spectrum of the 
decay electron is obtained by integrating Eq. (15.18) over angles: 

dr = —^\M\ 2 F{Z, E e )p e E ePv E v dE e (15.58) 

where we have used a more general expression \M\ 2 F(Z, E), instead of |(1)| 2 , that 
includes the Coulomb correction factor F(Z, E) and recovered the correct neutrino 
variables to discuss the mass of the neutrino. We have already shown that \M\ 2 
does not contain the energy variable in an allowed transition. If m v ^ 



p v E v = (E Q -E e )J(E -E e )2-ml, 
E = M(Z, A) - M(Z + 1, A) - m 
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Figure 15.10 (a) Spectrum of the elei 
(c) Kurie plot with smearing. Smearir 



We define a spectral function referred to as a Kurie plot by 



K(E) = 



lF(Z,E e ) Pe E e _ 



(15.60) 



If m v = 0, K(E) is proportional to £ — E c and describes a straight line. 

Figure 15.10a depicts a whole electron spectrum from /? decays. For the m v de- 
termination, only electrons near the end point are important. If m„ / the Kurie 
plot deviates from a straight line and is vertical at the end point (Fig. 15.10b), but 
after convolution with the experimental resolution {a), the curve becomes blurred 
(Fig. 15.10c). As m v can be determined more accurately with smaller value of E , 
the tritium isotope (T) with E = 18.6 keV is usually chosen as the /3 source. Many 
experiments have been carried out, so far, in vain. The result has always been con- 
sistent with zero mass, albeit with progressively improved upper limit. 



Troizk-Mainz Experiment The previous arguments are valid for early experiments, 
in which the mass sensitivity was no more than 10-100 eV. Modern neutrino exper- 
iments have resolutions close to or better than a few eV, so that atomic or molecular 
binding energy (a few eV) of the final states have to be considered (see [308] for a re- 
view), because the /3 ray source used for the experiment is not a tritium nucleus but 
is prepared as a neutral tritium atom or molecule T 2 . In the following experiments 
the molecular gas of tritium T 2 , whose level structure is well understood, was used. 
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Taking into account various final states of daughter products [( 3 He — T)++e], the /5 
ray energy spectrum is modified to 



dr ~ p c E e p v E v = p c E c (E - E e )J(E - E e ) 2 ■ 



-^ p c E e J2 p .( £ o - V, - E e )J(E -Vi- E e ) 2 - 



where V; is the excited energy of a final state and P; is its probability. The re- 
coil energy of the daughter is included in E . Note that the above formula tells us 
that the experimental observable is m\, not m„. The spectrum shape is shown in 
Fig. 15.10c, which includes smearing due to the excited energy of the daughters. 

In the early measurements, a magnetic spectrometer was used to determine the 
momentum of the decay electrons, but it only measures the transverse component 
of the momentum relative to the magnetic field lines and smearing due to scatter- 
ing cannot be avoided, thus making it hard to resolve the contributions of the final 
states. Modern detectors use an electrostatic spectrometer, which measures the to- 
tal energy of the electron. It filters and passes only a small portion of the high-end 
spectrum, which is the region most sensitive to the mass value. 

Here we describe the basic principles of the electrostatic filter of the Troizk [54] 
and Mainz experiment [243, 379], which uses MAC-E (Electrostatic filter with Mag- 
netic Adiabatic Collimator). A schematic layout is shown in Fig. 15.11. Two super- 
conducting solenoids provide the guiding magnetic field. Beta particles, which are 
emitted from the tritium source in the strong field B s inside the left solenoid are 
guided magnetically into the spectrometer along the magnetic field line and de- 
tected by a detector, which is located at the other end of the strong field B D . HV 
(high voltage) electrodes generate a retarding (accelerating) graded electric field in 
the left (right) half of the spectrometer and act as a filter to pass only high-energy 
/? particles determined by the voltage U . 

From the source, /3 particles are emitted isotopically. When a particle is emitted 
at angle 6 , its transverse momentum relative to the magnetic field line causes it 
to describe a circular motion and proceeds forward in a spiral along the field line. 
The angular frequency of the circular motion is given by the cyclotron frequency 
cd c = eB/y m, where m is the electron mass and y the relativistic Lorentz factor. 
The orbital angular momentum is a constant of motion for a constant magnetic 
field. The magnetic field of the spectrometer is not uniform but strong at both 
ends and weakest (B = B min ) in the central plane of the spectrometer. However, as 
long as the adiabatic condition 

~«co c (15.62) 

B dt { ' 

is satisfied, the orbital angular momentum is still a constant of the motion. The 

orbital angular momentum L (times e/2m) is a product of the magnetic moment 

[i created by the circular motion and the relativistic y factor and can be expressed 

e ecor , pi E T 
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adiabatic transformation E ± — > E B 



Figure 15.11 Principle of the MAC-E filter: 
Two superconducting solenoids produce a 
magnetic guiding field. The tritium source is 
placed in the left solenoid, while the detector 
sits in the right solenoid. Between the two 
solenoids there is an electrostatic retardation 
system consisting of several cylindrical elec- 
trodes. In the central plane perpendicular to 
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where / is the circular current, S the area of the circle, p its radius and £t the 
transverse energy of the electron. In the nonrelativistic approximation, which is 
valid for the tritium fi decay, 



= constant 8) 



(15.64) 



Equation (15.64) means the transverse energy diminishes in proportion to the 
strength of the magnetic field. Without the electric field, the total energy E = 
E\\ + Ej is a constant of the motion, namely, the spectrometer transfers the trans- 
verse energy to the longitudinal energy, as is illustrated at the bottom of Fig. 15.11. 
At the central plane (analyzing plane), B min /B s = 3 x 10 -4 [Tesla]/6 [Tesla] and all 
the /? particles are almost parallel to the axis of the two solenoidal coils. The electric 
field produces the retarding field and pushes all the particles back except those with 
energy higher than the maximum potential e U . The right half of the spectrometer, 
on the other hand, accelerates the particles that have survived the electric filter and 
delivers them to the detector. 



* = Vb = » H£ 
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Figure 15.12 Squared neutrino mass v 
2005 plotted against the year of public 



:s obtained from tritium f} decay in the period 1990- 



Practically all the /3 particles that are directed toward the detector, except those 
with a tiny fraction (B min /B s ) of £t, are accepted. Therefore the geometrical accep- 
tance of the source is nearly 2jt. 

Figure 15.12 shows the history of neutrino mass measurements. The current 
best value is [311] 



l±2.4eV 



(15.66) 



Since from the oscillation data m < 0.05 eV is inferred (see Sect. 19.2.1), the abso- 
lute mass measurement should be aimed at least at this value when future experi- 
ments are being planned. 

15.4.3 

Helicity of the Electron Neutrino 



The helicity of the neutrino cannot be measured directly, but it can be deter- 
mined by observing the recoil nucleus as it emits a neutrino following electron 
capture [179]. The reaction used is described in Fig. 15.13a. 



(15.67) 



In the first reaction, Eu at rest absorbs an electron in an orbital angular mo- 
mentum S state (K-capture), therefore the spin of the recoiled Sm* on emitting a 
neutrino is aligned in the same direction as the electron spin and opposite to the 
neutrino spin. If the momentum of the emitted y is along the spin direction of the 
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Figure 15.13 A schem 
helicity of the neutrino 
ing apparatus, (a) Eu(0~ ) captures an elec- 
tron, emits a neutrino, is excited to Sm* (1~ 
and subsequently decays to Sm, emitting y 
rays. Only those y 's that are emitted in the 
same direction as the recoil momentum hav 
enough energy to scatter and excite anoth- 
er Sm to a resonant state 963 keV above the 
ground state. This is possible because the 



half-life of 15 

ing of the level, thus allowing 
the y energy to reach the resonance energy. 
The helicity of the y can be measured using 
the different absorption in passing through 
magnetized iron, (b) The apparatus contains 
152 Eu in the magnetized iron. Emitted y rays 
are scattered by Srff203 below and detected by 
a Nal scintillation counter [179]. 



parent Sm* , the y preserves the parent's helicity. Therefore by measuring the y 
helicity, the helicity of the neutrino can be determined. The trick of this ingenious 
experiment is to pick out exclusively the forward-going y by using resonance scat- 
tering. Resonance scattering occurs only when the incident y has the right energy 
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to raise Sm to the resonant level 963 keV above the ground state 

y + 152 Sm ->• 152 Sm* ->■ 152 Sm + y (15.68) 

This means the energy of the incoming y ray has to be slightly larger than 963 keV 
to give the recoil energy to the target. This extra energy is supplied by moving 
Sm*, which emits the gamma ray in the forward direction. The helicity of the y 
can be determined by passing it through magnetized iron as the absorption rate 
depends on the value of the helicity. When the electron spin in the iron atom is 
antiparallel to the y spin, it can absorb the y by flipping its spin, but when the spin 
is parallel it cannot flip its spin, hence there is less absorption. The experiment 
has demonstrated the negative helicity of the neutrino and confirmed the weak 
interaction type as represented by Eq. (15.37). 

15.4.4 

The Second Neutrino V/, 

The decay electron from the muon has a continuous energy spectrum, indicating 
two neutral particles (neutrinos) are emitted. 

fT -+ e - + v, + Vp (15.69) 

The initial question was whether they are the same two neutrinos (v e , v £ ) as appear 
in nuclear j3 decays or two different neutrinos (v e , v^). 

If v n is identical to v e the muon should be able to decay to e+y through the pro- 
cess depicted in Fig. 15.14, because the second neutrino (v^) couples to the muon 
and the first to the electron. The calculated decay rate was within experimental 
reach but the process has not been observed to this day [311] 

BR(/< ->- e + y) < 1.2 x 10" 11 (15.70) 

If Vft / v e , the process in Fig. 15.14 is absent and i 
plained. The hypothesis can be tested by making decay 



Figure 15.14 ft — > e + y can occur through this process if the v 
is identical or mixes with v e , giving a much higher decay rate than 
observed. 
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react with matter. If V/, / v e , electrons will not be produced by v^ N reactions. 

V/l + n -*■ fi~ + p v fl + n-/> e~ + p 

v u + p^ fi + + n v u + p -/> e+ + n (15.72a) 

v e + n^e~ + p v e +n-f» fi~ + p 

v e + p -y e+ + n v e + p -> fi + + n (15.72b) 

An experiment carried out at BNL in 1962 confirmed the hypothesis [112]. Only 
muons are produced and no electrons were observed in the reaction using neu- 
trinos from ji decay. This was the first evidence that there exists a new quantum 
number (one might call it "e-ness" and "mu-ness" to distinguish the two kinds of 
leptons) that is conserved in the leptonic weak interaction. Later, the tau lepton r 
and its associated neutrino were discovered in the e~ + e+ reaction and "tau-ness" 
was added. Today, they are generically called lepton flavor. 



15.5 

The Universal V-A Interaction 

15.5.1 
Muon Decay 

The muon was discovered in 1937 by Anderson and Neddermeyer [293] and was 
mistaken as the meson predicted by Yukawa because it had about the right mass 
(trip ~ 200m e ). The confusion lasted for ten years until finally the pion was discov- 
ered in 1947 by Powell [80] (see Fig. 13.5). But after the pion had been identified 
as the right Yukawa meson, the muon remained a mystery. Except for its mass, 
the muon had identical properties to the electron and may well be called a "heavy 
electron". Except for the fact that it can decay into an electron and neutrinos, all 
the reaction formula for the muon may be reproduced from those of the electron 
simply by replacing its mass. Nobody could explain why it should exist. The per- 
plexity was exemplified by Rabi's words: "Who ordered it?" Although its existence 
itself was a mystery, its properties have provided essential clues in clarifying the 
characteristics of the weak interaction. Precision experiments on the muon prop- 
erties provide a sensitive test of the Standard Model and remain a hot topic to this 
day. 

Decay Rate The muon decays practically 100% of the time to an electron and two 
neutrinos. Here we investigate its decay rate and see what it tells us. Let us define 
the kinematical variables as 

ft~(Pi) -> e "(P4) + v e (p 2 ) + v fl ( Pi ) (15.73) 
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The interaction Lagrangian may be written as 



Here, the Fermi coupling constant is denoted as G /( for the sake of later discus- 
sions. The transition amplitude is given by 

T ft = (2jt) 4 d 4 ( Pl - p 2 - Pi - p 4 ) 

Gu r sir si < 15 ' 75 ) 

If the muon is not polarized, taking the spin average of the initial and the sum of 
all the final state spins, we can express the decay rate as 

dr = — — \M\ 2 dLIPS (15.76a) 

(15.76b) 



d i Pl d 3 Pi d 3 P' 

u.1.11- j - ^jl, u y Fl - F2 -P1- F4)- 

where 



dLIPS = (2ji) 4 d 4 (p l -p2-p3-p 4 ) P ^ F— (15.76c) 

1 ' KF F F FV {2jt) i 2E 2 {2jr) i 2E i (2jt) i 2E 4 l ' 



K L " v ( Pl , Pi ) = (i/4)^[w(p 3 )y / '(i-y 5 )u(Pi)] 

x[u( Pi )y"(l-y 5 )u( Pl )]* 
= 2 [{PiP3 V + Pi" Pi ~ g f,v {Pi ■ Pi)} + ^"''"pipPia] 
= J""W3) + QTWj) (15.77) 

[see Appendix Eq. (C.lOd)]. Using 

P" r ( Pl , Pi)Prv(P2, Pa) = 8{(jn • Pl )(Pi • p 4 ) + (j>i • 1U)(1>2 ■ p 3 )} 

Q""(Pl, P3)Q„ V (P2, P4) = 8{(J»! • p 2 )(P3 • ?4) - (Pi • P4)(P2 ■ j> 3 )} (15.78) 

P'' F Q„„ = Qf"^, = 
we obtain 

K i " w (pi,j> 3 )K^,(P2,l>4) = 16(pi • Pi)(Pi ■ Pa) (15.79) 

If Eq. (15.79) is inserted in Eqs. (15.76), the expression for the decay rate becomes 

\M\ 2 = 64Gf t ( Pl ■ p 2 ){pi ■ Pa) (15.80a) 

, r 64G2(p 1 - i32 )( i3 3-p 4 ) d 3 rf 3 d 3 w1 ro AM 

dr = £ — d 4 (pi — p 2 — Pi— Va) (15.80b) 

2m,\2nf 8E 2 EiE 4 
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Problem 15.3 

When m 2 = rrii = 0, show 

(15.81) 

Hint: From Lorentz invariance the equation can be expressed as Ag /lv + BP flv . 
Calculate A and B in the center of mass frame. 

Using Eq. (15.81), we have 



— y s^ Fl f F vl m F \j-~--v) 

If we neglect the electron mass, energy takes its minimum value in the muon 
rest frame, and maximum value m fl /2 when the momenta of the two neutrinos are 
both in the opposite direction to that of the electron. Putting pi = (m M , 0, 0, 0) and 
setting m e — we obtain the electron energy spectrum 



This reproduces the experimental data (Fig. 15.15) quite well. 
Integrating over the energy, we obtain the total decay rate 



Problem 15.4 
Derive Eq. (15.83). 

I Problem 15.5 
Show for m e > Eq. (15.84) is modified to 

r= i9^ / (^)' /(y) = 1 - 8 v + 8y -y -"y 2 iny (is.ss) 

The precise expression including the radiative correction is given by [53, 333] 
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POSITRON MOMENTUM MeV/c 



Figure 15.15 Energy spectrum of the decay electrons 
including the radiative correction with the Michel par 



vv. The line i 
= 3/4 [44]. 



Inserting the observed muon lifetime [311] 

r = — = 2.197019 ± 0.000021 x 10" 8 s (15.87) 

we find the coupling constant is determined as 

G fl ~ 10" 5 GeV" 2 (15.88) 

which is almost identical with Gp [Eq. (15.29)] obtained from nuclear j3 decays. 
This was the first indication that the coupling constant of the weak interaction 
was universal. For later discussion we list the two coupling constants and compare 

: 



= 1.16637 ± 0.00001 x 10~ 5 GeV - ' 



(15.89a) 



Gp = 1.14730 ± 0.00064 x 10" 5 GeV" 2 (15.89b) 

The small but significant difference between the two coupling constants will be 
discussed later. 
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Michel Parameters In calculating the muon decay rate, we assumed V-A inter- 
action. Historically, however, the most general expression [Eq. (15.23)] was used 
to determine the interaction type. In the electron energy spectrum from polarized 
muons, angular correlations with the polarization axis vary according to the inter- 
action type. Its general form is expressed in terms of Michel parameters [115, 141, 
274]. The usefulness of the parameter lies in that it can be applied not only to the 
muon but also to the tau or any new lepton and determine its interaction type. 
Development of surface muons 9 ' has enabled precise experiments using polarized 
muons, providing a very effective test bench to check the validity of the Standard 
Model and beyond. The distribution of the decay electron from the polarized muon 
fi^ 1 can be expressed in the approximation neglecting the electron mass as 

i dr ,r 2 

r^77^= 4 - 2 3(l-x)+-p(4x-3)TP^cose 



H] 



-d(4x 



where P fl is the muon polarization, x = 2£ e /m / , and 6 is the angle between the 
muon spin and the electron momentum. Note that the term with the polarization 
is the parity-violating term, p, §, <5 are the Michel parameters [141]. In order not to 
make the equation too complicated, we restrict the interaction type to V and A in 
the following. The general formula can be found in [311]. The Lagrangian can be 
written in the form 



+eyA 1 + r s ) v * (w" [srl(i - y 5 ) + gRR(i + y 5 )]M] + h.c. 

(15.91) 



The Michel parameters are expressed as 

_ 3 |g LL | 2 + |g RR | 2 A _ 3 |g LL | 2 -|g Rf 



(15.92) 



In the V-A or the Standard Model, g LL = 1, g LR = g RL = g RR = and gives 

p = a = - , | = 1 (15.93) 

If we adopt V-A as proven for the electron, we can set g RL = g RR = and test only 
the muon part, g LL = 0, g LR = 1 for the V+A type and g LL = gLR (gLL = -gLR> for 

9) If matter is irradiated with pions, slow pions stopping in the matter, muons produced in 
jr — > f.i v have some definite energy. Especially, those from the surface of matter have a clean 
monochromatic spectrum. 
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pure V(A) type, giving p = (V-A), 3/8 (V), 3/8 (A). The data show [311] 

p = 0.7509 ± 0.0010 (15.94a) 

d = 0.7495 ± 0.0012 (15.94b) 

£ P„ | = 1.0007 ± 0.0035 (15.94c) 

| P fl \ is the polarization of the muon and should be 100% for muons obtained from 
n ->■ fi v in the V-A interaction. An experimental analysis using the surface muon 
beam gives [40] 

lgu.1 > 0.888 , |g LR | < 0.060 , |g RL | < 0.110 , 

|g RR | < 0.033 90%CL (15.95) 

In summary, the data confirms the V-A-type Lagrangian with high accuracy. 

15.5.2 

CVC Hypothesis 

Isospin Current The form of the interaction determined by the nuclear and muon 
decay can be combined to give 

£wi = ^ [pyA 1 - L26y s )n] [l Y u (l - y s )v e + J[ Y "(1 - y >„] 



As far as the vector part of the weak current is concerned, the coupling constant 
of the muon decay G,, and that of nuclear /? decay (Gp) agree within 2%. n ->■ p 
transformation is really d -*■ u transformation and is influenced by quark wave 
functions to form a proton bound state. Namely, the coupling here includes the 
strong interaction correction. Based on near identity of the strength, it was sug- 
gested that the weak vector current is a conserved current, namely 

dujw + ' u = d/tj'/^n = d fl lpy t "r + xp = 

[pi + TO 11 (15-97) 

The inference is based on the analogy of the identical strength of the electric 
charge for the electron and the proton. The proton, interacting strongly, couples 
to the electromagnetic current not only as a single proton but also through pion 
clouds as depicted in Fig. 15.16a, and yet the strength of the charge is identical 
to that of the electron, which does not have such a complication. The identity is 
guaranteed by current conservation: 

9 /< Jem = W'P = hVY* { ^T 1 ) V = ° ( 15 ' 98a ) 
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:+ /v p |/y p |/y 



V e p A p k°/v 



P|^v PL^v 



(b) 

Figure 15.16 Diagrams to show the universal- 
ity of the coupling constant. The strength of 
the electromagnetic current is invariant under 
the strong interaction effect as depicted in (a) 
because of the current conservation. Similarly, 



I hypothesis makes the weak charge 
it under the influence of strong interac 
rections. If (e~v) is replaced by W~ 
espondence is more apparent. 



Here, in moving from the second equality to the third, we used 

P = ^</>, * = [j]. r, = [j _°J (15.98b) 

Equations (15.98) have the form of Nishijima-Gell-Mann's rule in a differential ex- 
pression and (1 + r 3 )/2 corresponds to 7/2 + I 3 . Comparing (15.97) and (15.98), 
we see the r 3 part of the electromagnetic current and the weak vector current 
(oc t ) have the same form. This suggests they are just different components of 
the same isospin current 



j j = yy" (2) v . 9 /<j/=o 

This is referred to as the CVC (conserved vector c 



(15.99) 



:nt) hypothesis [167]. 



Chiral Invariance The coefficient of the axial vector current (y V) of the nucleon 
differs from that of the lepton by 26%. If this is regarded as the result of the strong 
interaction correction, it is logical to believe that, at the quark level, the coupling 
must be equal. This advances to a stage one step beyond where the nuclear f) decay 
(Z, A) -*■ (Z + 1, A) was considered as n ->■ p, and /? decay should be considered 
as a d — >• u process. At this most fundamental level, it is conceivable that the weak 
interaction is described by 



= [uy"(l-Y S )d][e Y/l (l-y 5 )v e ] 



(15.100) 
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This is referred to as the V-A interaction. Later Adler [5] and Weisberger [384] used 
the PCAC (partially conserved axial vector current) hypothesis 10 ' and succeeded in 
deriving C A /Cy = 1.26 in terms of Jt-N scattering cross section, thereby confirm- 
ing the above conjecture. Equation (15.100) has a simple and beautiful form and 
agrees with what can be derived theoretically from the chiral invariance [355, 356]. 
It requires that the equation of motion should be invariant under the chiral trans- 
formation 

ip — »• %p' = exp(— iy s a)ip , rp J[ — >• %p' ] = ip ' exp(iy 5 a) (15.102) 

Note, it is a gauge transformation but changes the phase of the left- and right- 
handed components in the opposite direction. The V-A type was also inferred from 
the two-component neutrino theory [144] and mass inversion requirement [342]. 
Namely, the V-A interaction has a certain theoretical foundation as a basic starting 
point of the equation of motion. Since 

^(^H- 1 ^) * = ** + * (15 , 03) 

the Lagrangian for the free Dirac particle has the form 

2W = ¥(iY f 'd, - m)xp 

= ilp L y / 'd fl ip L + i'Jp R y l *d fl ip R + m(lp L ip R + lp R ip L ) (15.104) 

The second equality can be derived using Eq. (15.103) and anticommutativity of 
y 5 with all other y f "s. It is easy to verify that the vector and axial vector current 
are invariant under chiral transformation, i.e. !p'y"ip' = Ipy^xp, !p'y 5 y"ip' = 
!py 5 y"ip. However, the mass term transforms as 

xp xp' — ip'e'" y y e~ '"" y 

= lpe~ 2iay ''ip = cos2alpip - ism2alpy 5 ip (15.105) 

that is, it is not invariant. Namely, chiral invariance requires the vanishing of the 
fermion mass, which, in turn, makes the left- and right-handed components in- 
dependent, as shown by Eq. (15.104), in the absence of interactions to couple left- 
and right-handed components. One notices that Yukawa-type interaction X Yukawa ~ 
(gipip + g?ipy 5 ip)cp mixes the L-R components but current- type interactions such 
as ^£ EM ~ eipy^ipA^, or four-fermion interaction do not. The chiral transforma- 
tion can be considered as independent gauge transformations for left and right 
components and is referred to as the chiral gauge transformation: 

rp L ^ip' L =e~ iaL ipL, '/'r^ V R = e _i " R ^R (15.106) 

which is conserved in the limit m„ -> 0. 

Here, f„ is the decay constant defined in 
one may say it is almost conserved. In a loose Eq. (15.44) and <j> n is the pion field, 

sense, this is what PCAC means. In a more 
rigorous definition, it is expressed as [162] 

3,, j A " = f„m\(j>„ (15.101) 
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Problem 15.6 

Show that a in Eq. (15.105) and a L , a R in Eq. (15.106) are connected by a = (a R - 

«l)/2. 

The fact that the weak Lagrangian has the form that is required by chiral gauge 
invariance has a fundamental significance for an understanding of the weak inter- 
action. It means that the fermion mass has to vanish and that the left- and right- 
handed components are independent, i.e. different particles. Indeed, if the weak 
force is generated by the weak charge, only the left- handed component carries it. 
The CVC hypothesis and chiral invariance provide an important clue for the unifi- 
cation of the electromagnetic and weak forces. 

Experimental Test of CVC 

(i) Weak Magnetism: Using Lorentz invariance, a general expression for the proton 
electromagnetic current can be written as 

Jem'' = (p'\PY''p\p) 

= u(p') \F pl (q 2 )y" + ll-F p2 (q 2 )ia" v q} u(p) (15.107) 

q 2 = (p-p') 2 , F pl (0) = F p2 (0) = 1 

If the proton is a point particle, i.e. a Dirac particle without higher order cor- 
rections, F pl (q 2 ) = 1, F p2 (q 2 ) = 0. % p is the anomalous magnetic moment of 
the proton in units of nuclear magneton e/2m p . Note the term proportional to 
q/i _ pfi _ p'ft van i snes by current conservation and that proportional to p fl + p /fl 
can be expressed by the above two terms using the Dirac equation for u(p'), u(p) 
[see Eq. (8.5)]. The electromagnetic current of the neutron can be obtained by 
changing the subscript p — >■ n. Since the neutron has vanishing electric charge 
but finite anomalous magnetic moment, F nl (0) = 0, F n2 (0) = 1. Comparing 
Eq. (15.107) with Eq. (15.98), we see the r 3 part of Eq. (15.98) may be written as 

Jem" = y\F vl (q 2 )y» + J^F V2 (q 2 )io>" qJ^V (15.108a) 



XvFvi = X p F p2 -x„F„ 2 >%p ~ln 

X P = 2.792847351(28) , % n = -1.91302473(45) (15.108c) 

where the number in parenthesis of the experimental value denotes the error in the 
last two digits. The CVC hypothesis means the weak charged current Eq. (15.97) is 
the same as the terms in square brackets of Eq. (15.108a). Namely, the vector part 
of the weak current should be expressed as 

J w +H = lp\F vl (q 2 ) Y " + -^-F vl{ q 2 )io^q} T+rp (15.109) 
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Therefore, if the second term (referred to as weak magnetism) in the square brack- 
ets of Eq. (15.109) can be shown to be the same as the anomalous magnetic mo- 
ment of the nucleon, the CVC hypothesis is proved. As the term is proportional to 
q, it does not contribute to the allowed transition of /3 decay and one has to turn to 
the forbidden transition to discuss the matter. Since we do not want to go into too 
many details of nuclear theory, we do not discuss the CVC hypothesis in the P de- 
cay any more here. Historically, however, the weak magnetism was measured [259] 
and provided the first evidence of the CVC hypothesis. 

Another test is to perform high-energy neutrino scattering by a proton (v \ + p — >• 
I + n) and see if it has the same q 2 dependence as ep elastic scattering (e + p — >• 
e + p) [9]. Here, we consider the effect of CVC in pion decay. 
(ii) it + ->■ jt° + e + + v: The electromagnetic current of n^ is proportional to 
the third component of the isospin by the Nishijma-Gell-Mann rule. Considering 
Lorentz invariance and separating the coupling constant, one finds 

Jem" = (x + (k')\j EU f, (x)\jr+) 

= [Fi(q 2 )(k't' + k") + F 2 (q 2 )(fe'" - k")] {7t + \h\x+) e - 1 ^ 
q 2 = (k'-k) 2 (15.110) 

where k,k' are the initial and final momenta of the pion. The form factor F x is 
normalized to become 1 in the static limit (q 2 = 0). Current o 

3,, Jem'' = 
.-. F l{q 2 ) { k' 2 -k 2 )+F 2 (q 2 )(k'-k) 2 = 



Substitution of k' 2 = 
current is expressed a 



JW = (k" , + ^)i J i(3 2 )(/|l3|i>« 
The CVC hypothesis requires 

_/ w " = (k'." + fe")F 1 ( C? 2 ){/|J_|i) 



(15.111) 

makes F 2 (q 2 ) = 0. Therefore the electromagnetic 
(15.112) 



(15.113) 



since I_|l,+1) = V2|l,0), i.e. (jt°\ I-\jt+) = -Jl [Eqs. (E.3), (E.10)]. For the ;r + 
decay q 2 is small and the form factor can be approximated by F 1 (q 2 ) ~ Fi(0) = 1. 
We have the matrix element 

M{7t+ -+ Jt°e+v) = -^V2(k''' + F)u(p e )y,,(l - y 5 )v(p,,) (15.114a) 

V2 

Then the decay rate can be calculated as 



r = (2jr)*d 4 (A-p e -p r )- 



-El^l 2 



o (2jr) 3 2F e (2ji) i 2E v 

(15.114b) 
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The result is given in [104, 222]: 



<*B A I \ A y /m,\ 

r = -^[l R{—)- 0.393s 

30tt 3 V 2 m„+ ) V A I 



The observed value is 0.394 ± 0.01 s 1 and agrees with the CVC hypothesis very 



As A — m M + — m n a <^C m,«, we may neglect the jt° momentum, which gives 
k /ft + fe'' = (m T + + m„o)d f Q. In this approximation, show Eq. (15.114b) becomes 



W(\-AI2m n+ )- 
30tt 3 1 



AJm„+ V A > 
with Eq. (15.115) within a few %. 



15.6 

Strange Particle Decays 



15.6.1 

AS = AQ Rule 



Particles carrying strangeness may not have the same interaction as the familiar 
hadrons composed of u and d quarks. So we shall see first if we can apply the same 
rules to phenomena associated with strange particles if we replace the d quark by 
the s quark. As the hadronic part of the Lagrangian of the /3 decay was expressed as 

it" ~uy"(i-y 5 )d (15.117) 

it is natural to think that the decay of strange particles is induced by 

j+"~uy"(l-y 5 )s (15.118) 

with leptonic part unchanged. Under this assumption, the relative ratio of K^ -»• 
e ± + v and 7<C ± -> fi^ + v can be calculated in the same way as for the pion, 
leading to the formula corresponding to Eq. (15.49): 

riK^ev) m] (m 2 K - m 2 e ) 2 
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That the formula gives a value which agrees well with the experimental value 
2.45 ± 0.11 x 10 -5 supports the above assumption. If the hadronic weak current is 
expressed by Eq. (15.118) this is an operator to annihilate s and create u (and vice 
versa). Since u carries ( J = J 3 = 1/2) and s (S = — 1,7=0), we expect that in the 
strangeness changing reaction, relations 

s^u Q:-I^, I : 0-^ |, S:-l->0 (15.120) 

hold which means that the quantum number of the hadronic part undergoes 

AS = ±1, AS = A£), AI = l/2 (15.121) 
This is also obtained from Nishijima-Gell-Mann's law: 

Q= h + ^(B+S) (15.122) 
Consider the list of observed leptonic decays of the strange particles 

K ± -> jt° + e ± + 7 (15.123a) 

Kl -» n ± + e T + 7 (15.123b) 



(15.123d) 



The regularity is obvious. They all satisfy AS = A Q. On the other hand, a reaction 
with AS = -AQ such as 

S+ ->■ n+ e + + v (15.124) 

has not been observed and its decay rate relative to (15.123d) is below 0.009. It is 
easy to understand if we think in terms of the constituent quarks. As.T - = uus,s 
can be changed to u by emitting W~, converting S~ to n = uud, but S + = uus 
can only be changed to uuu and not to n = udd. K° can be obtained either from 
K° or K° and it is a mixture of S = 1 (ds) and S = -1 (sd). Therefore, it should 
be able to decay to both, n V^ v . However, once Kl's parent is identified as K° 
or K , one of them violates the A S = A Q rule and should be forbidden. One can 
n observable 

(AS = -AQjrate _ (jT-\+v\T\t) 
(AS = AQJrate («-I+v|T|JC°) ( ' ' 
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where K L ->■ n~\ + v in the numerator and denominator are separately obtained 
from K and K°. The measured value gives [311] 

Re{x} = -0.0018 ± 0.006 , Im(x} = -0.0012 ± 0.002 (15.126) 

and is consistent with 0. A more comprehensive treatment is given in the next 

15.6.2 

A/ = 1/2 Rule 

We turn now to the A J = 1/2 rule. It is easy to understand it in leptonic or semilep- 
tonic decays in Eq. (15.123), but consider hadronic decays such as 

A -+ p + 7i~ , A^n + it (15.127) 

The transition amplitude may be written as 

T fi = G s (p!uy"(l - y s )s\A)(7i-\dy fl (l - y 5 )u|0) (15.128) 

where G s has been used instead of Gp , since it is to be determined by experiment. 
The above expression contains both a A 7 = 1/2 (us) and a A 7 = 1 (du) part, 
allowing both A 7 = 1/2 and A 7 = 3/2 transitions. Experimentally, however, the 
A 7 = 1/2 rule seems to be respected in hadronic decays as well as leptonic decays. 
If one requires the rule, we should have 

riA -+ nit ) 1 

nz4rr^i = 3 (15 - 129) 

Correcting for the phase space difference, the right-hand side becomes 0.345, 
which is to be compared with the experimental value 0.358 ± 0.005. 

The decay K+ -+ jt+ it (I = 2) is a AJ = 3/2 process (Problem 15.8). It is 
much less common than the A I = 1/2 process K° ->• jtjt (7 = 0): 

(15.130) 



r(K° 



The validity of the A 7 = 1/2 rule in hadronic decay is believed to originate from the 
dynamics of the strong interaction, but it is not well understood yet and n 
as a problem for QCD to solve. 



I Show that jr+jr cannot have 7 = 1 when J = 0. Therefore, if A 7 = 1/2 is strict, 
K + -► jt+jt is forbidden. 
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Problem 15.9 

(a) Using the A I = 1/2 rule show that 

riK s -> 7Z+JI-) r(K L ^ 7Z+71-) 
riS-^-An-) 

riBo^A**) - 2 (15 ' 131b) 

(b) Assuming a + , a~ , a are decay amplitudes for H + -> nn + , S~ —> njt~ , 
S + -+ pjt°, show that 



Hint: Assume the existence of a hypothetical particle, the "spurion", carrying I = 
1/2 and consider the decay as the scattering with the spurion where the isospin if 
conserved (see Sect. 13.3.1 for isospin analysis). 



15.6.3 

Decay Rate As a representative of decay processes satisfying |AS| = 1, we pick 
outK+ -► jr°l+vi(l = e,fi). The reaction is similar to ji+ -+ jr°ev, Eq. (15.114a), 
as far as the kinematics is concerned. Therefore, denoting the coupling constant as 
G s , the decay amplitude is given by 



S" = (jt (k M )\Sy u (l-y 5 )u\K+(k K )) 

= f+(q 2 ) (k/ + k/) + f.(q 2 ) (k K « - k/) 
l fl = u(p v )Y fl (l-Y>(P e ) (15.132) 

As both K and ji have negative parity, the axial vector current does not contribute. 
f± are functions of q 2 = (k K — k„) 2 and real if T invariant. As 

k K - k M = p e + p v , k K + k M = 2k K - (p e + p v ) (15.133) 

and multiplication of the leptonic current by the lepton momenta produces mj, 
/_ is multiplied by m 2 . Therefore, when I = e, the /_ term can be neglected. 
As an extension of the CVC hypothesis, let us assume that the vector part of the 
strangeness-changing weak current is conserved, too, Then just like the isospin 
current [S U(2) symmetry] was induced from CVC, S (J(3) symmetry can be applied 
here and /+(0) becomes 1/V2. As S 1/(3) symmetry is not exact, the correction 
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changes the value slightly (1 -» 0.982 ± 0.008 [263]). Since q 2 is a small number, 
the form factor is usually parametrized as 



/±(9 2 ) = /±(0)[l + ^±(<7 2 /»4)] 



in the experimental analysis. In the K /l3 decay, both f+ and f— can be determined, 
but in K ei decay only _f_|_ is determined. We discuss K ei only here. The electron 
energy spectrum and (jt°v) angular correlation can be calculated from Eq. (15.132) 

T7|2 J3t ^3„ ^3„ 

dr = (2jifd A (k K - k„ 



'2m K (2jif2co n (2n) i 2E e (2ii) i 2E v 

(15.135a) 

= 2G||/+| 2 Tr [(/ e - m) ^(1 - y 5 ) /, ^(1 - y 5 )] 

= 16G S 2 1 /+ 1 2 [2(k K • p,)(fc K -p v )- m 2 K (p e • p,)] (15.135b) 

Electron Spectrum To obtain the electron spectrum, we have to integrate over k n 
and p v . We set f+(q 2 ) = /+ (0) in the following calculation. We need to calculate 

/ p v ^5 A (p v + k Jt -a)—^- = Ad 1 , where a= k K -p e (15.136a) 

The equality is deduced from Lorentz invariance. To find A, we multiply by a fl on 
both sides and, making use of a 1 — m\ = 2(a ■ p v ), we obtain 

A=a ~ia^ L j d ~t d ^t 8A[Pv + klt ~ a) (15 ' 136b) 

The integration can be performed easily in a frame where a — 0: 

r ^p,, d 3 k„ s4 , r d 3 »„ t 

J -jj^ -^-& V + k„-a) = J ^<5(£, + o»* " £«) 



As a result we obtain 



Inserting Eq. (15.136d) in Eq. (15.135) gives 

dr = Wfi [2{kK ' ^ )(fe * ' a) ~ <{Pc ' fl,] n{ ^r^ d ~t (15 ' 137) 
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• Experimental data 

(a) Vector spectrum 
(with radiative 

(b) Scalar spectrum 

(c) Tensor spectrum 



Figure 15.17 Comparison of V, S, T ir 

spectrum of the decay electron [132]. (b) 7i°v e an] 

and v in the dilepton center of mass system [75]. 



>»+v,. (a) The energy 



Using a = k K — p e , and evaluating variables in the kaon rest frame, \ 



2(k K - Pe )(k K -a)- 



2 k(P, ■ «) = 



:fmd 
(15.138a) 
(15.138b) 
(15.138c) 



where E max is the maximum energy of the decay electron and we have set m e = 0. 
Taking into account Eqs. (15.138), we finally obtain the energy spectrum of the 
decay electron: 



dE t 






««£;- 



' (m 



-2E e ) 



(15.139) 



This formula agrees with experimental data (Fig. 15.17a). 

We have confirmed that the vector-type interaction (with maximal parity-violat- 
ing component, of course) reproduces the experimental data very well. In deriving 
Eq. (15.139), we assumed f+(q 2 ) = constant. 
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Form Factors of K i3 How can we derive information on /+(g 2 )? Since q 2 = (k K — 
k„) 2 = m\ + m\ — 2m K u)jr is a function of only a> n , information on f+(q 2 ) can 
be obtained by observing the energy spectrum of the pion. It can be calculated by 
performing a similar calculation as for the electron spectrum. The formula is 

dr G 2 \f + \ 2 
= sU T' ™ K kl (15.140) 

Then from K ei data one can obtain A+ and from K^ data, A_. However, recently, 
it has become more conventional to use [311] 

Mq 2 ) = f+(q 2 ) + f-(q 2 ) 2 *_ 2 (15.141a) 

/o(g 2 )=/o(0)(l + 2o^j (15.141b) 

The present value of observed A-j- [311] is given by 

1+ = 0.0296 ± 0.0006 
A = 0.0196 ±0.0012 (15142) 

Im g = Im ( ^- J = -0.006 ± 0.008 

The information on Im£ comes from T (time reversal) violating transverse polar- 
ization Pr of fi [226], where 

"'' (15.143) 



n K E fl + \p fl \cosd flv -ml/m K 



Interaction Type In Fig. 15.17, theoretical spectra for the interaction type S (scalar) 
and T (tensor) are also given. They can be calculated starting from 

T f , = -^ [2« K / S {e(l - y 5 )n} + ^ W {eo, v (l - y 5 )^}] 



instead of Eqs. (15.132). The angular correlation between jt° and v in the dileptoi 
center of mass frame is given by [75] 



Figure 15.17b shows Jt°v c angular correlation. It is clear from the figure that V-A 
interaction is equally valid for strangeness-changing weak interactions. 
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Strength of Strangeness-Changing Weak Decays The total decay rate can be ob- 
tained from Eq. (15.139) by integrating over the electron energy: 

Glflml (m 2 \ 
r Ki = sJ+ K f l-s.) 

768jt 3 \m\) (15.146) 

f(y) = 1 - 8y + 8y 3 - y 4 - 12y 2 In y 

/ (ml/m 2 K ) = 0.579 for m n lm K = 0.273. Comparing Eq. (15.146) with the exper- 
imental value z K = 1.2380 ± 0.0021 x 10" 8 s, BR(K+ -+ jt°e + v e ) = 0.0508 ± 
0.0005, gives 



Problem 15.10 

Derive Eqs. (15.140) and (15.146). 



15.6.4 

Cabibbo Rotation 

We have learned that the V-A interaction derived from the muon and nuclear ft 
decays are equally valid for strangeness- changing decays. However, the measured 
coupling constant Gj is approximately 1/20 of Gj and is very small. This is also 
true for other strangeness-changing decays. In other words, the coupling is univer- 
sal in the sense that the strength is equal for all strangeness-changing processes, 
yet the universality does not apply between reactions with A S = and A S / 0. 
This very annoying situation was saved by introduction of the Cabibbo rotation [84]. 
It frequently happens in quantum mechanics that a good quantum number 
changes depending on what kind of interaction is active. For example, the orbital 
angular momentum is a good quantum number when the central force is active 
and spherical harmonic wave functions are suitable for the treatment of dynamics. 
However, when spin-orbit coupling is active, the total angular momentum is the 
good quantum number and the wave functions are mixtures of differing orbital 
angular momentum. Cabibbo thought that when the weak force is active, a state 
suitable for dynamic treatment may not necessarily be the mass eigenstate that is 
observable as a particle. The state suitable for describing the dynamical behavior of 
the particle is actually a mixture of mass eigenstates 

&' = cos c d + sin 6 c s (15.148) 

and the particle that participates in the weak interaction is not d but d', which is 
referred to as the "weak eigenstate". Cabibbo proposed that the weak current should 
be expressed as 

iw" = uy«(l-y 5 )d' (15.149) 
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This is referred to as Cabibbo rotation. According to this idea, the Fermi coupling 
constant should be universal, but in experiments only the mass eigenstates are 
picked up. Consequently, only a component of the particle that is contained in the 
weak eigenstate is measured. The fraction is given by the (square of) of the Cabibbo 
rotation angle. In the strangeness-changing process the fraction is given by sin 2 6 C 
changing G^ ^ G s = G fl sin C . From K e} data [75] 

sin d c = 0.220 ± 0.002 (15.150) 

From A (5 decay [72], it was determined as 

sin 6 C = 0.231 ± 0.003 (15.151) 

and on the average, 

sin B c = 0.221 ± 0.002 11 ' (15.152) 

Strictly speaking, the above value should be restricted to vector coupling only, but 
other measurements including the axial current also confirmed the same value. In 
summary, the V-A interaction can be applied equally well to all the strangeness- 
changing weak interactions by introducing a single parameter, the "Cabibbo an- 
gle" 8 C . 

Cabibbo's proposal means that the coupling constant Gp of the nuclear j3 decay 
is actually Gp = G /t cos 9 C . No Cabibbo rotation is necessary for the leptonic cur- 
rent. Since there was a small but significant difference between Gp and G ft , as was 
shown in Eq. (15.89), identifying the difference as cos 6 C gives the same value of 
the Cabibbo angle and 

cos C = 0.973 ± 0.002 12) (15.153) 

From now on we use Gf to denote the strength of the universal four- Fermi interac- 
tions instead of G^ in honor of Fermi. 

1 1) When the quark family is extended unitarity of the Kobayashi-Maskawa matrix 
to three generations including (c, s), ,..„.., 

(U), the Cabibbo rotation becomes the I V n | 2 + | V 12 | 2 + | V 13 | 2 = 1 (15.154) 

Kobayashi-Maskawa (KM) matrix. In this Hence | V „ | 2 + | V12 1 2 need not necessarily 

case sin 6 Z should be replaced by V 12 or V„ s . become unity, but it accidentally happened 

Numerically, however, the difference is very that | V 13 | 2 « |V„| 2 , |V 12 | 2 , and the Cabibbo 

sma11 - rotation, which closes within the two 

12) For the same reason as in footnote 11, cos C generations, worked well, 
should be replaced by Vn or V ud . From the 
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Flavor Conservation 



15.7.1 

CIM Mechanism 



A careful reader might have already noticed that so far we have talked only about 
the charged current weak interaction or that mediated by charged W , where the 
participating members change their electric charge. It is natural to imagine that 
there is a neutral current interaction and it is compulsory if the weak force carrier 
W constitutes an S U(2) triplet, as is the case in the Standard Model. Indeed, the 
existence of Z° and its associated reactions was among the major predictions of the 
Glashow-Weinberg-Salam model, and subsequent experimental confirmation es- 
tablished its validity. Historically and phenomenologically, the need for the neutral 
current was not strong. The effect of the ordinary (strangeness conserving) neutral 
current (i.e. j^ c ^uy^u, dy" d) is hidden by much stronger electromagnetic cur- 
rents, hence it is not surprising that their effect was not observed. Moreover, there 
was strong evidence of the absence (or suppression to be exact) of the strangeness- 
changing neutral current. For instance, consider the kaon decays in Table 15.1 that 
are induced by the charged current (CC) and the corresponding neutral current 
interaction (NC). Their Feynman diagrams are depicted in Fig. 15.18. Table 15.1 
suggests the absence of Z° for the diagrams (b) and (d) in Fig. 15.18. However, 
there was no reason to believe in the nonexistence of the strangeness-changing 
neutral current and it was a mystery at that time. In 1970 Glashow, Iliopoulos and 
Maiani [174] proposed the existence of the fourth quark, and that it should make a 
doublet with s', an orthogonal state to the Cabibbo-rotated d': 

d' = d cos 9 C + s sin C 



The Cabibbo-rotated d' makes a neutral current 

d'd' = dd cos 2 6 C + Is sin 2 C + sin C cos 6 c (sd + ~ds) (15.156) 

where space-time structure was omitted in the above equation. Equation (15.156) 
strangeness-changing current ~sd + ds, which should induce diagrams 



ion of strangeness-changing m 



CC 


BR 


NC 


BR 


K+-,ji°e+v c 


63.54 ± 0.14% 
5.08 ± 0.05% 


K° -» fi-/i + 


6.84 ± 0.11 xlO- 9 
1.5+J, 3 x 10" 10a 



The experimental value is for K+ -> jr+ + £ ; v, v 
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Figure 15.18 The charged 
strangeness-changing 




nd (c) K+ ■ 
+ and (d) K 



(b) and (d) of Fig. 15.18. But if a state expressed a 
neutral current that sV makes 



i Eq. (15.155) exists, the 



cancels the strangeness-changing part of d d' and together with d d' the neutral 
current becomes 



dd + 1 



= dd + ls 



(15.158) 



The absence of the strangeness-changing current is explained. This is called the 
GIM mechanism. There had been other predictions of the fourth quark as early as 
1964, but its existence was seriously considered only after the GIM proposal. The 
charm quark was subsequently discovered in 1974. 

The reason why the strangeness-changing neutral current exists, albeit very sup- 
pressed, is due to the higher order charged current processes (Fig. 15.19). In the 
Standard Model, G F ~ gw/ m w> an d me value of g w is comparable with that of 
e. 13) Therefore the contribution of higher order diagrams (Fig. 15.19) would be on 
the order of a, but the observed values are much smaller. This is because the GIM 
mechanism is at work in the higher order contributions. There are processes de- 
picted in Fig. 15.19(b) or (d) corresponding to Fig. 15.19(a) or (c) that are obtained 
by replacing c with u. Since the coupling of W to ud or cs is ~ cos 6 C and that to 
us or cd ~ sin C with differing sign, the contributions of (a) and (b) or (c) and (d) 
are equal but with opposite sign. If the mass of the charm quark were the same as 
that of the up quark, the cancellation would be perfect and the decays would not 
happen. Assuming m c » m u the decay amplitude can be calculated and gives [154] 



M(K° -+ fiji) ~ aG F l — 

13) To be exact, G F /V2 = g 2 w /(8m 2 w ), < 



(15.159) 
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Figure 15.19 GIM mechanism at work in the to the GIM mecha 
higher order corrections, (a) and (b) are di- and the above pro< 

agrams for K~ -* /l~fl+, (c) and (d) for The mass differeni 



should be complete 
:s would be forbidden, 
akes a finite contribu- 



'. If m c = mj, cancellati 



o (a) + (b) or (c) + (d). 



Adjusting the mass value to fit the experimental value, one can obtain m c ~ 
1.5 GeV. The mass prediction was done before the charm quark was discovered. 
The significance of the c quark discovery was not just the confirmation of the 
fourth quark, it also established the doublet structure of the weak interaction and 
at the same time explained the absence of the strangeness-changing neutral cur- 
rent. Unlike the s quark, the c quark was predicted and anticipated. The discovery 
was an important milestone in the construction of the Standard Model. In sum- 
mary, there is strong evidence for the absence of the strangeness-changing neutral 
current (SCNC) and there is no compelling reason to believe in the existence of the 
strangeness-conserving neutral current, at least from a phenomenological point of 



15.7.2 
Kobayashi-Maskawa Matrix 

We know today that there are six quarks in three doublets with hierarchical struc- 
ture in the sense that the mass of the quarks increases with the generation (see 
Table 19.1). The Cabibbo rotation is generalized to a 3 x 3 unitary transformation 
matrix, which is referred to as the Cabibbo-Kobayashi-Maskawa (or CKM) ma- 
trix [239]. Namely, the W boson couples to the three doublets 



with equal strength, and they are mixtures of the n 



(15.160a) 
s eigenstate D = (d, s,b): 

(15.160b) 
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The unitarity of the CKM matrix is a GIM mechanism extended to the three gener- 
ations: 

d'd' + sV + b'b = ~D'D' = ~DW UD = ~DD =~dd + Is + bb (15.161) 

There is no flavor-changing neutral current of any kind; this is not limited to 
strangeness only. Though there are nine independent variables in the 3x3 uni- 
tary matrix, considering the quark field's freedom to change its phase, the number 
is reduced to four: three rotation angles 6 12 , 23 , #i 3 and one phase d. It is conven- 
tional to express the CKM matrix in the form 



Uckm = R2ih D Rull D Ru 



p I I" en s 13 e-*<H r 

= c 23 s 23 1 - 

Lo -s 23 c 23 J l-s u e id c 13 J L 

■( 







CuCn s 12 c i3 Si 3 e \ 

-si 2 c 23 - c 12 s 23 s 13 e ,d c 12 c 2i - s 12 s 2 3S 13 e ,d s 2i c u J 
Si2S 23 - c 12 c 2i s u e ld -c 12 s 2i - s 12 CnSue ld c 2i c u ) 
I dD = diag (1, 1, e id ) , s 12 ~ 0.220 , s 23 ~ 0.04 , s 13 ~ 0.004 

(15.162) 

Here, Rij is a rotation matrix on the ij plane and c;j, sij stand for cos dij, sin 0;j. 
The Cabibbo angle C = 0i 2 , but the 2x2 Cabibbo rotation matrix worked well 
because c 13 ~ 1, s i3 ~ 0. However, the b ->■ u transition [actually B _ (fou) ->• 
jr°(uu) + e + + v e etc.] has been observed, showing s 13 / 0. 

It is worth noting that the motivation for introducing the CKM matrix was 
not merely an extension from two to three generations. Indeed, Kobayashi and 
Maskawa proposed it in 1973 before the discovery of the charm quark. Its moti- 
vation was to introduce a complex phase to explain CP violation, which will be 
explained in the next chapter in more detail. 

15.7.3 



The t lepton was discovered in 1975 [318] and is one of the few elementary particles 
that can be observed in its bare form. The decay of the x has many features in 
common with muon decay, and is suitable for investigating the universality of the 
weak interaction. We shall describe a few properties of r and how to determine 
them, then we discuss general features of the weak interaction in the lepton sector. 



Production If there is 
the reaction 



iugh energy (^/s > 2m T ), a t pair can be produced fi 
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Figure 15.20 Topology of e~ + e+ ->• r~ + r+ production. 
t decays to / + v\ + v r or v r + hadrons. As r's are short lived, 
only decay products of the pair are observed. A typical topology 



Decay patterns of the r lepton are written as 

T ± ^V r + (W ± ) 



(qi~q; —*■ hadrons : q{q- = ud, us, etc. 

where ( W ) is virtual. As the r lepton has very short decay lifetime (2.910±0.015x 
10 — 13 s), what are observed are decay products of r~ r"*" pairs (Fig. 15.20). If both 
tau's decay leptonically, the topology consists of an acoplanar 14 ' lepton pair (e~ e"*", 
e — /*"*", /< — //"*") and it is easily recognized. If only one r decays leptonically, the 
topology is a single lepton in one hemisphere and a few hadrons in the other hemi- 
sphere, another combination easy to recognize. 

Mass and Spin The production cross section of a spin 1/2 particle is given by [see 
Eq. (7.38)] 

Oi/2=^^- s (15.165) 

The equation is common to ee — > /.ifi, rr if s 3> (2m;) 2 . Immediately above the 
threshold, however, the effect of the produced mass or nonrelativistic correction is 
necessary. If we write the Lorentz factor of the particle j3 = p T /E T , y — (1— /3 2 ) - 1 I 2 , 
the cross section is modified as (see Problem 7.2) 

aror = Oirf (^-) ( 15 - 166 ) 

In order to determine the spin of r, theoretical cross sections for different spins 
14) Acoplanar means that one of the pair is not on a plane made by the incoming e~ e+ and the other 



were considered [370, 371]: 

Orr = OipFiP) 
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/3 3 /4 






s = 


/3(3-/3 2 )/2 






s = 1/2 


/3 3 (3 + 4y 2 )/4 






S = 1, K = 


/3 3 (3 + 2y 2 + 4y 4 )/4 






S = 1, K = 1 


/3(15y" 2 + 30 / 8 2 + 40 / 3 4 y 


+ 16/3 6 


' 4 )/9 


s = 3/2, A = 



(15.167b) 

where the magnetic moment of the spin 1 particle is defined as fi = (1 + K)e/2M. 
Note that a fermion-antifermion pair is produced in S-wave (a oc |j>|) and the 
cross section rises sharply at the threshold as a function of energy, while that of 
boson pairs is in P-wave (a oc \p\ 3 ). This is due to parity conservation in e~ e + re- 
action, where the dominant contribution comes from one-photon exchange, which 
has negative parity. The boson pairs have positive intrinsic parity, hence S-wave is 
forbidden. 

Figure 15.21 compares the measured cross section with theoretical predictions 
for various spins [317]. It is clear that x has s = 1/2. Fitting the cross section with 
the theoretical curve immediately above the threshold gave the mass as 

m T = 1777.0 ± 0.3 MeV (15.168) 
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Figure 15.21 Comparison of the total production cross section ee — ► xt with theoretici 
predictions for various spins. R rT = a{ee -y zr)/a(ee -+ fiji) [35,317]. 
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Coupling Type As the r has a large mass, it has many decay modes. Among them, 
purely leptonic modes 



have the same kinematical structure as muon decay. Therefore, investigation of 
the energy spectrum of the decay leptons allows the Michel parameters to be deter- 
mined (Fig. 15.22a,b), [28, 311]): 



p = 0.745 ± 0.008 

The value agrees with that of V-A interaction. 



Universality The leptonic decay rate can be calculated using the identical formula 
as for muon decay [Eq. (15.84)] with the mass replaced by m T : 



Then the lifetime of i 



rl 192jt 3 
s given by 



BR(r" 



■r) 



(15.172) 



Setting G^ = G r and inserting the observed values for m r = 1777.0 ± 0.3 MeV, 
m„ = 105.658389 ± 0.000034 MeV, r /t = 2.19703 ± 0.00004 x 10" 6 s, BR = 
0.1783 ± 0.0008, one finds the calculated value agrees with the experimental val- 
ue of r r (exp) = 2.910 ± 0.015 x 10~ 13 s within the error. Conversely, using the 
experimental value to test the universality of the coupling constant gives [164, 169] 



- = 0.999 ± 0.006 



(15.173) 
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Figure 15.22 Comparison of the r energy spectrum with V±A predictions, (a) r -* evv , (b) 
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If the weak interaction actually occurs through W exchange, then G„ ~ g 2 g 2 and 
G r ~ g 2 g 2 . So the above universality applies to g„ — g r , giving g u /g T = 1.001 ± 
0.003. By investigating the ratio of r -► fiVf,v T and r -» ev e v T , we can also test 
g/x — ge universality. Using 

rjr-^^vr) _ BR(r- ^^v r ) _ g 2 g 2 , /K/m 2 r ) 

r(T- -> e v 6 v t ) BR(T- -> e ¥ e v r ) g 2 g 2 /(rnj/m?) 

where J(y) is given in Eq. (15.85) and comparing with the experimental value 

BR(t~ ^ uv u v T ) (17.36 ± 0.05)% 

— ' f^JL^l = \ L_ = 0.974 ± 0.004 (15.175) 

BR(t~ -+ ev e v T ) (17.85 ± 0.05)% v ' 

gives g„/g e = 1.001 ±0.002. 
We may say the universality is very good. 

Branching Ratio and Quark Counting As the mass of r is large, there are many 
hadronic decay modes. But as m c > m T , the hadronic decay modes are limited to 



v x + d + u probability = c 
v r + s +u probability = s 



(15.176b) 



Owing to the Cabibbo rotation, r decays to d u with probability cos 2 and to su with 
sin 2 6 C ; their combined decay rate can be considered as one to a single channel d' . 
At the quark level, the decay rate W~ ->• ud' is the same as the leptonic decay 
(W~ ->■ e~v c or fi~v u ) provided the mass of the quarks and leptons is neglected. 
It is a good approximation since m e ~ m u ~ nn ~ a few MeV, m^ ~ 100 MeV. 
Even the heaviest [i is much lighter than m r (~ 2 GeV). Since the quark has an ex- 
tra color degree of freedom, the decay rate to the quarks has to be tripled. Therefore 
a total of five channels are open for r decay. Assuming all the quarks hadronize, 
the electronic and muonic decay modes are predicted each to be 1/5 = 20%. This 
is very close to the observed value of 17.7%. It demonstrates a practical use of the 
quark model, that simple quark counting gives fairly good agreement with experi- 
ment. A more precise estimate considering the QCD correction gives much better 
agreement [169]. 

15.7.4 

The Generation Puzzle 

From what we have analyzed so far, the three leptons e, fi, z have the same proper- 
ties except for their mass. Various kinds of reaction rates can be accurately repro- 
duced by simply changing the mass. Despite their similarity, they have their own 
quantum number, referred to as lepton flavor. (v e , e~) has e-quantum number, 
(Vji, fi~ ) [i-mimber and (v T , t~) r-number, which are conserved in the weak inter- 
action, and the three pairs do not mix. The fact that the three neutrinos are also dis- 
tinct, each having separate flavor, can be deduced from the following observations. 
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From the energy spectrum of the pure leptonic decay of the muon (fi~ -» erv^^) 
and the r lepton (r~ -*■ e~v e v r , fi~v^ v r ), we know there are at least two kinds of 
neutral particles in each decay, which we call neutrinos. That v e and v M have differ- 
ent flavor can be deduced from the absence of ft -*■ ey , as discussed in Sect. 15.4.4. 
Using the same arguments and from the absence of r -*■ fiy, ey , we can deduce 
v r / Vft, v e . Other evidence comes from the fact that v e , v^ cannot produce r di- 
rectly. Precision tests of the leptonic flavors come from the absence of the following 
decay modes: 

BR(p->ey) < 4. 

BR(fi ->• eee) < 1. 

BR(r^ey) < 1. 

BR(T -ypy) < 4. 

BR(r^3i) < 3. 

BR(Z^ hl 2 )(h / J 2 ) <1.7xl0- s (J 1 = e , J 2 =/0 

(15.177) 

Today, lepton flavor conservation is accepted and is built in to the Standard Mod- 
el. The flavor does not mix in the Standard Model. If one adds a small mass 15 ' to 
the neutrino, which is a natural extension of, but nonetheless one step beyond, 
the Standard Model, the mixing is proportional to (m„/m w ) 2 [97], which is ex- 
tremely hard, to say the least, to detect in decay reactions. Why do leptons with 
the same properties appear repeatedly? This is a question not resolved by the Stan- 
dard Model. It is the same old problem of Rabi's "who ordered it?" extended to the 
e-/u.-r puzzle. It is also referred to as the family or generation problem, because 
the same question exists in the quark sector. Namely, (c, s) and (t, h) seem to be 
replicas of (u, d) except for their mass. Combining the quark and lepton doublets, 
(u, d, v t , e~), (c, s, v pl ,fjL~) and (t, b, v z , t~) constitute three generations. 

One idea to resolve the puzzle is to consider that they belong to the same sym- 
metry group, called horizontal (or family) symmetry. Another is to consider that 
leptons and quarks are composites of more fundamental particles. In this model, 
fi, x are excited states of the electron. In modern string theory the family structure 
arises from vacuum or space-time structure of extra dimensions. At present, there 
are no hints as to how to differentiate between the models. 

How Many Generations? So far we have discovered three generations of the 
quark/lepton family. The Standard Model assumes there are no more. The asser- 
tion is not supported by compelling evidence, however, there is an indication that 
the number of generations is indeed limited to three. It came later, after the Stan- 
dard Model was established, but we will jump ahead. It comes from the absence of 
an extra neutrino in the decay of Z°. Z° is produced and decays in the following 

15) If there are mass differences between the neutrinos and if they mix, neutrino oscillation occurs 
(see Sect. 19.2.1), as has been observed. 
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i,dd,i 






The Z boson (m z = 91.2 GeV) cannot decay to a top quark pair because the top 
quark is too heavy (m top = 171.2 ± 1.2 GeV). Production of quark pairs is observed 
as hadrons. The neutrino pairs cannot be observed, but their decay branching ratios 
can be accurately estimated from 



-E- 



*-X> 



(15.179) 

where /tot is the total decay width of Z, which can be determined from the cross- 
section shape (Fig. 15.23). If there is an extra neutrino, the total decay rate /jot 
changes, hence an observable branching ratio, for instance that to hadrons BR = 
F(Z — y hadrons) //tot, does, too. 

The data agree quite well with the assumption that Z decays to three neutrinos, 
three charged leptons and five quarks without top but with three color degrees 
of freedom. Figure 15.23 shows hadronic total cross sections. From the data the 
number of types of neutrinos is determined to be 

N„ = 2.92 ± 0.05 (15.180) 

The above argument does not exclude the possibility that a neutrino with mass 
greater than m z /2 ~ 45 GeV or an exotic neutrino that does not couple to Z (re- 
ferred to as a "sterile neutrino") exists. However, the fourth generation, if it exists, 
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Figure 15.23 Hadronic total cross section vs number of neutrino types, e 
Yl <J;<7; where q, = u, d, s, c, b. The data agree with the Standard Model a 
neutrino types was determined to be 2.92 ± 0.05 [262]. 
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must include a neutrino with large mass (> 45 GeV). Considering all the know: 
neutrinos have nearly vanishing mass, it is considered an unlikely situation. 



15.8 

A Step Toward a Unified Theory 



15.8.1 

Organizing the Weak Phenomena 



In the Standard Model, the weak force is mediated by W bosons, but that was not 
necessarily taken for granted from the beginning. The notion of the force carrier 
in the weak interaction was seriously considered only when the near universality 
of the coupling constant in the muon and the nuclear /3 decay was realized [256]. 
With the establishment of the V-A interaction, it became a main current. Let us 
summarize what we have learned so far and see where the W boson fits in. 
The phenomenological weak interaction can be reproduced by the Lagrangian 

4G F , 
£wi = --^(A + )"Ur)« (15.181a) 

UiY = UI"Y = £ to* T+ ^« (15.181b) 

where T+ is the charge-raising operator and not to be confused with the r lepton. 

*-£]■[;]■ M- [?]■!>]■[?] — 

Vl=-(1-Y S )V (15.181d) 

Namely, all the weak interactions of the elementary particles have the same com- 
mon coupling constant Gp and are of V-A type. We can rephrase the term V-A 
interaction to mean that only left-handed (or more rigorously negative chirality) 
particles carry the weak charge that generates the force. Then it can be simply stat- 
ed that the weak force is a universal vector-type interaction. Fermi's intuition was 
right after all. 

At about the same time as Fermi proposed the weak theory, Yukawa proposed 
the jt meson as the nuclear force carrier. He tried to explain the weak force using 
the same n but was unsuccessful. However, the existence of a force carrier is a far- 
sighted concept that is carried over to the modern Standard Model. If one consid- 
ers the weak force to be mediated by a vector boson ( W) having a large mass, the 
Fermi interaction can be reproduced as a second-order process, as we discussed 
at the very beginning (Sect. 15.2). The characteristics of the weak force having a 
vector force carrier with universal coupling constant is reminiscent of QED. One 
naturally wonders whether the weak interaction has the same structure as the elec- 
tromagnetic force. 
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The Fermi coupling constant has dimensions of "mass 2 ". In the old days, the 
proton mass was used, making the value of G F ~ 10~ 5 /m 2 p very small. This is 
the origin of the name "weak interaction". However, if m w (~ 80GeV) is used, 
the strength of the coupling constant is about the same order as the electromag- 
netic force. This was one of the first clues to the unified theory. In other words, 
in the high-energy region, where the typical energy scale is comparable to or larg- 
er than m w , the effect of the weak force would be just as strong as that of the 
electromagnetic force. The weak force is weak simply because the energy scale E 
being considered is much smaller than the mass of the force carrier, reducing the 
effective strength of the force in proportion to E 2 /m 2 w . 

Once we accept the existence of Ws, not only can all the weak interaction dy- 
namics be reproduced in an organized way but also the resemblance to QED is 
quite apparent: 



V2 



Yl\ 1 PiLY f ' T+ 'ipiLW + fl + h.c.l (15.182a) 



Sm 2 w 4l v • ; 

The factor l/-\/2 is conveniently added for later discussions and has no special 
meaning. In summary, all the weak reactions can be described in a simple and 
beautiful form by taking into account 

the quark model 
V-A interaction 
W ± as the force carrier 
the Cabibbo rotation 

Figure 15.24 describes how various weak reactions can be interpreted by the W 
mediating processes. The V-A interaction type was determined around 1957. Until 
then interpretations of /5 decays were in great confusion, but the discovery of pari- 
ty violation prompted a rapid convergence within a year. 1957 was an epoch in the 
history of the weak interaction. A big tide of aspiration for the discovery of the W 
boson emerged, and from then on many accelerator construction proposals tried to 
justify themselves by including the potential capability of W production. However, 
the mass being unknown at that time, whether a proposed accelerator could real- 
ly produce it remained uncertain. Then finally came the Weinberg-Salam model, 
which predicted the mass accurately. 

The universal V-A theory could be said to be a summary of the pre-Weinberg- 
Salam era, and it is very powerful phenomenologically. Even today, unless one 
wants to discuss the direct production of W and related phenomena, it can give 
satisfactory explanations for not only decay reactions but also neutrino scatterings. 
The phenomenological success was a big factor in causing the Fermi theory to be 
taken seriously despite its theoretical shortcomings, which will be described in the 
following. Similarity with QED, including the universality and vector-type interac- 
tion and the fact that the weak force shares part of it with the electromagnetic force 
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Figure 15.24 Reorganizing hadronic weak When (b) /S decay, (c) K° -+ 7l+n~ and 

interactions in terms of quarks and W. All the (d) v fl n — > fi~ X (hadrons) are rewritten 

interactions are reproduced by a current made using the quarks, they are represented by 

of (u,d') (where d' = d sin 8 C + s sin C ), (e), (f) and (g), respectively. The coupling 

(v e ,e~) and (v fl ,fi~) coupled to W ± . constants are written as g d = g w cos0 c , 

(a) The muon decay is elementary as it is. g s = gw sin 6 C and g e = g„ = g w . 

as exemplified in weak magnetism, provided crucial clues and a springboard for a 
unified theory. 

15.8.2 

Limitations of the Fermi Theory 



Although phenomenologically powerful, it is immediately apparent that the four- 
Fermi V-A theory is not satisfactory from the theoretical point of view. For instance, 
the scattering cross section of a neutrino by an electron (v e e~ ->• v e e~) can be 
easily calculated using Eq. (15.181) to give 



do 



^(v c e-^v e e 



(15.183) 



The cross section increases proportional to the square of the c 
ergy s. On the other hand, the unitarity restricts the cross sec 
momentum J (unitary limit) 

CTtot 5 (2/ + l)Jil 2 = (2 J + l)7t/p 2 = 4ir(2J + l)/s 
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As the four-fermion Lagrangian is a contact interaction, only the S wave con- 
tributes, hence the theory is valid only if 

(Jtot < xf -► s < 2tt/G f -► p < 300 GeV (15.185) 

One might think the introduction of W may save the situation. However, in the cal- 
culation of higher order corrections, the W boson contributions to the intermediate 
state have to be added or are integrated over the energy. In the limit p — >• oo, all the 
masses can be neglected and the most divergent term in the correction generally 
has the form (see for instance Eq. (8.2)) 



-*/*# 



d A p 

(15.186) 



In the case of the weak force, the coupling constant G F has dimension E~ 2 , which 
makes f(p) oc p 2 and the integral diverges badly. The divergence at p ->■ cxd is the 
inevitable fate of quantum field theory, but there are two kinds of divergences, good 
and bad. The four- Fermi interaction is a typical example of bad divergence. Good 
divergence means that it can be removed by introducing a finite number of correc- 
tion terms (renormalizable theory) and gives correct results if treated with the right 
recipe, as was clarified in QED. We saw that the anomalous magnetic moment of 
the electron and muon was reproduced to an accuracy of 10 -10 and that the appli- 
cability of QED extends to the energy region as high as a few TeV. This fact provides 
a solid foundation for the renormalizable theory that, despite its shortcomings, is 
very close to the true theory. This is why QED is always considered as a model in 
reaching the right theory. In QED, the coupling constant corresponding to G F is 
dimensionless. If one chooses QED as a model, the coupling constant has to be di- 
mensionless, too. Therefore introduction of W seems the right step for remedying 
the problem. 

Introduction of W Let us introduce W and see that g w in the Lagrangian 
Eq. (15.182) is indeed dimensionless. Let us consider v e + e~ — > v e + e~ again. The 
interaction Lagrangian has a similar form to the QED Lagrangian £qed = — J ' '' A fl : 



That g w is dimensionless can be seen as follows. Pick up the mass term in the 
Lagrangian of the vector field [see Eqs. (5.20d) and (5.22a)] and consider its physical 
dimension: 

j d 3 x£ w \ = \ j d^xmlpip \=\ j d^xm 1 ^ W u W = [E] (15.188) 

This gives [X] = [E% [tp] = [E] 3 / 2 , [ W„] = [E], and comparing the dimensions of 
the two sides of Eq. (15.187) proves g w is dimensionless. 
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scattering diagram, (a) The lowest order, (b) Higher order \ 



Now we can investigate the effect of W exchange in v c scattering as shown in 
Fig. 15.25. The matrix element of Fig. 15.25a is given by 



ma.] 

..A (i-y 5 L - 



l flv -q fl q,./m 2 w ) 



The differential c 






s section becomes 



(G 2 F /4jt 2 )s 



dQ 4ji 2 \m 2 w -q 2 ) [l + (s/m 2 w )(l-cos6»)] 2 
Integration over the solid angle gives 

OTor(v c e- -> r.O = ^ — * (15.191) 

<7 TOT ->• constant, which is a great improvement compared to linear divergence in 
Eq. (15.183). 
However, if we calculate the S-wave scattering amplitude f 

G r s f 1 

/„ = ^fl- / d COS e ; — ^- 

4jt J l + (s/m 2 w )(l-cos6) 

V WJ (15.192) 

= ^lnl + - 

we find this diverges logarithmically and the problem is not solved, at least not in 
the lowest order. What about higher order contributions? If we consider a diagram 
that contains two Ws in the intermediate states like the one in Fig. 15.25b, we need 
to add those for all polarization states of W as well as the internal momentum k. 
As the Feynman propagator of the W boson is given by 



Ml)' 



E A e."(A) e *(l)* _ 



r-q f, q"/rr, 



(15.193) 



(s\^ 
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the contribution of Fig. 15.25b contains an integral of the form 

~g 4 w jd 4 kA fpo (k-q)A^ lv (k) 

x [M( P 3)y p S F ( Pl - k) Y "u( Pl )] [u(p 4 )y°S v (p 2 + k)y"u(p 2 )] (15.194) 

HP > (2jt)*j P p 2 - m 2 + is 

If we pick up the dominant contribution at high energy (k -> oo), it becomes 
4 f j4h k p k"/m 2 w k"k"/m 2 w EJlEJt 

-)ifi^ 0(k2) (15 - 195) 

where E 2 arises from the normalization of the wave function and (g 2 w E 2 / m 2 w ) can 
be taken away from the divergence consideration. The last expression corresponds 
to f(p) ~ p 2 in Eq. (15.186). One notices that the divergent term is generated by 
q f 'q" in the numerator of the W propagator. 

Origin of Divergences The origin of the divergent term is easy to understand. A 
massive vector boson has three degrees of freedom in the spin components. The 
polarization vector e 1 ' (X), (A = 1, . . . , 3) in its rest frame can be taken as 

e" (1) = (0,1,0,0) (15.196a) 

e" (2) = (0,0,1,0) (15.196b) 

e" (3) = (0,0,0,1) (15.196c) 

Setting the z-axis along the momentum direction of the W, and Lorentz boosting 
the particle to where Whas p\% = (cu, 0, 0, q), we find the transverse polarizations 
do not change but the longitudinal polarization is transformed to 

6*(3) = (3/mw,0.0,©/»w) (15.197) 

We realize that the second term in the numerator of the propagator (15.193) comes 
from the longitudinal polarization. This is equivalent to introducing the dimension 
[E]~ 2 in the coupling constant and induces a divergent integral. When the mass of 
the force carrier is zero, it has only transverse polarization and the divergent term 
is not generated. 

Furthermore, gauge invariance, which forces the mass of the force carrier to 
vanish, also guarantees the reduction of the divergent terms [see Eq. (8.51) and 
associated arguments]. It was QED's lesson that a gauge-invariant theory has, after 
all, at most logarithmic divergences and they can be subtracted by renormalizing 
the coupling constant and the mass. 
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15.8.3 

Introduction of SU(2) 



If one wants to apply gauge theory to Ws, one has to deal with the difference be- 
tween the W boson and the photon. This is twofold. First, the W boson is charged, 
and second, it has a finite and large mass. In order to solve the problems one at 
a time, we forget the finite mass for the moment and see if gauge theory can be 
applied to the charged force carrier. Let us consider what kind of symmetry it can 
have. Defining a column vector 



r ± = -(n±ir 2 ) (15.199) 

where T;, i = 1, . . ., 3 are the Pauli matrices, we can reformulate the Lagrangian 
(15.187) as 

£ m = -^W L y f '(T + W+ + r_ W~) W L 



-Y W lY ,x (tiW 1ii + t 2 W 2ii )W l 



V2 
The form reminds one of the isospin-independent nuclear force: 

X N = -i^W N y 5 (T-ji)W N 



= (7r 1; n 2 , jr 3 ) 



Apart from the space-time structure (not vector but pseudoscalar), it has almost 
identical structure to Eq. (15.200) if we replace 

W N -► W L , Jt ->■ W (15.202) 

Here, we need to introduce a neutral W° to make them symmetrical and add 
(r 3 /2) Wj to terms inside the parentheses on the second line of Eq. (15.200). The 
nuclear force was isospin independent or invariant under exchange of p and n. This 
is S 1/(2) symmetry, to which the gauge theory of QED was generalized by Yang and 
Mills in 1954 [396]. Therefore, requiring the weak interaction to be invariant under 
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exchange of (v e <-»■ e~) or the "weak isospin independence", we postulate the weak 
Lagrangian has the form 



The S 1/(2) symmetry assumption is to require the existence of the third member, 
a neutral W° boson. For comparison's sake let us write down the electromagnetic 
Lagrangian for the (v e ,e~) pair: 

£ EM = e(ey"e) = ^Fy"(l - r 3 )^A„ , W = M (15.204) 

Comparing Eq. (15.204) with the weak current we see immediately that the isospin 
current 

j { " ~Wy"TiV (15.205) 

appears in both the weak and the electromagnetic current. In fact, we have already 
seen that the vector part of the weak current and the electromagnetic current is 
identical apart from the coupling constant. It is a consequence of the CVC hypoth- 
esis and has been experimentally confirmed. Thus, the weak current not only re- 
sembles the electromagnetic current but also owns a part of it jointly. This strongly 
suggests that the weak interaction is also governed by a gauge theory and is a part 
of a larger symmetry group containing both the weak force and electromagnetism. 
Conjecture had reached this point when the V-A interaction was determined in 
1957. Glashow even conceived a model to unify the weak and electromagnetic in- 
teractions [173]. However, the finite mass of W, which apparently violates gauge in- 
variance, was the obstacle to going further. To reach the electroweak unified model, 
deep insight to the theory of gauge invariance is necessary. We shall discuss gauge 
theory and the electroweak theory in Chap. 18. 



16 

Neutral Kaons and CP Violation- 



Symmetry is a main theme of modern particle physics. The discovery of parity 
violation was an epoch-making event in its history and we have learned in the pre- 
vious chapter that it stems from the different action of the weak force on left- and 
right-handed particles. Pursuit of its origin has led us to chiral gauge symmetry, 
which is the foundation of modern particle theory. The discovery of CP violation in 
1964 was also unexpected, and despite 50 years' effort there has been little progress 
in understanding its true nature. Nevertheless, we have ample reason to believe 
that elucidating its origin would establish yet another paradigm of particle physics, 
because recently it was confirmed that the framework in which CP violation is at 
work requires (at least) three generations of quarks and leptons, whose existence 
remains a mystery referred to as the flavor problem (or family or generation prob- 
lem). It is also known that CP violation played a crucial role in the early universe in 
developing from the hot big bang chaos to the present form of a matter-dominant 
universe. There is a good chance that pursuit of CP violation will lead to true un- 
derstanding of the origin of the universe and uncover the mystery of flavors, about 
which the Standard Model has little to offer. 

CP violation is built into the Standard Model within the framework of the 
Kobayashi-Maskawa (KM) matrix, but its dynamical origin remains to be clarified. 
It is also suggested that although CP violation as observed experimentally can 
be fully explained by the KM framework, it is not enough to explain the matter- 
antimatter asymmetry of the universe. In this sense "CP violation" can be consid- 
ered as an exceptional topic in the Standard Model. Needless to say, concentrating 
on a small anomaly has often provided a chance to open a door to a new view, and, 
in the author's personal opinion, understanding the vacuum issues (also known 
as the Higgs mechanism and dark energy) and CP violation are the two key factors 
to a new physics beyond the Standard Model. In this chapter, we discuss neutral 
kaon processes, in which the CP violation was discovered and which are the most 
investigated. Although we do not touch phenomena involving B decays, the neutral 
kaon provides sensitive tests for other fundamental questions, including T, CPT 
invariance and quantum gravity, and deserves a special chapter of its own. 
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16.1.1 

Strangeness Eigenstates and CP Eigenstates 

There are two kinds of neutral kaons, K° and K , which have strangeness S = +1 
and S = — 1, respectively. S is a conserved quantum number in the strong inter- 
action, as can be seen in the reactions with particles that have known strangeness 
(S„ = S p = S„ = 0, S A = -1): 

ji-(du) + p(uud) -+ K°(dl) + A{uds) (a) 

SO +1-1 

Jt-(du) + p(uud) -> K°(ds) + K°(sd) + n(udd) (b) 
SO +1-1 

(16.1a) 

K°(dl) + p(uud) -> K+(ws) + n(udd) (c) 

S +1 +1 

-t(sd) + n(udd) -> K-(s«) + p(uud) (d) 

S -1 -1 

K°(sd) + p (uttd) -> ^+ (ul) + /l (uds) (e) (16J ' 

S -1 -1 

K°(sd) + n(udd) -+ 7T-(du) + ^ + ( WW s) (f) 

S -1 -1 

K° {dl, S = +1) pairs with K+(us) and K° (sd, S = -1) pairs with K~ (su) to 
make isospin doublets that are antiparticles of each other: 

The above strangeness-conserving reactions and their doublet structure are obvi- 
ous if one analyzes them in terms of the quark model, taking note of the fact that 
the strong interaction does not change a quark's flavor. 

As to decay processes, they have common decay modes including 2jt, 3tt and 
ji V^ v and cannot be distinguished by strangeness. This reflects the simple fact 
that it is not a good quantum number in the weak interaction. On the other hand, 
since the discovery of P as well as C violation, the combined operation CP has 
been considered to be conserved in the weak interaction. We shall come back to 
CP violation later, but let us discuss processes first assuming it is conserved. We 
define a CP operator by 

CP|K > = e , ' &p |K > = )7 C p|K >, 

-0 - t n , n < 16 ' 3 ' 

CP\K ) = e-^\K°) = V * P \K°) , 
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It is conventional to set the phase | C p = and define CP eigenstates by 

K 1 > = (|K > + |"K°»/V2 (16.4a) 

\K 2 ) = (\K°)-\T?))/V2 (16.4b) 

Then CP| K x ) = | Ki), CP\K 2 ) = -| K 2 ) and K lt K 2 can have different decay modes. 
The 2jz and 3ji systems to which the K meson can decay have CP values +1 and 
— 1, respectively, hence the decay modes are separated as Ki — >• 2?r, K 2 —*■ 3jz. 

(1) 2jt: Remembering Pn = — jt, P(2jt) = (—1)' and the C operation exchanges 
n~^~ -o- n~ , we find Bose statistics dictates that C and P operations give the same 
sign. Therefore CP(jt+jt _ ) = (— l) 21 = +1, where I is the orbital angular mo- 
mentum. As CPjt = jt°, Bose statistics forces I of the 2jz° system to be even and 

CP(7T°7T°) = 4-1. 

(2) 3jt: For ji + Ji~jt°, denoting the orbital angular momentum of jt+ ji~ as I and 
that of jt° relative to the center of mass of tz + n~ as L, we have CP(ji + jt~ji°) = 
(— 1) 2,+L+3 . As the spin of the K meson is 0, I = I. Because of the angular mo- 
mentum barrier (see Sect. 13.5.3), the dominant component is CP = — l. 1 ' For 
3jt°, the same argument as for 2jt° makes I = even and CP(3jt°) = — 1. 

The above argument shows that K\ decays to 2jt and K 2 to 3jt. As the Q-value 
(m K - J2 m n) °f K i -> 2;r is much larger than that of K 2 -+ 3ji, the /<i ->■ 2^ 
decay rate is much larger than that of K 2 — >• 3jt. Namely, iCi is short lived compared 
to K 2 . 

16.1.2 

Schrodinger Equation for K° — K States 

Both K° and K share common decay modes 2jt and 3tt. This means a transition 
K° «* (2jt, 3jt) ^> K° or mixing is possible and {JC°|H|K°) ^ 0. The existence of 
the mixing means that K° can change to K in vacuum. Time development of the 
wave function of the K°, K system can be written as 

i^ip(t) = Hip(t) (16.5a) 

i/>(t) = a(t)\K°) + b(t)\K°) + J2c m (t)\m) (16.5b) 

Here, | m) is any other state than K° and K into which K°, K can decay. The 
Hamiltonian H is hermitian and contains the strong interaction as well as the weak 
interaction, which induces mixing. However, it is very difficult to solve the equation 
in general. It is convenient to consider only two states K°, K and introduce a mass 
operator A such that the two-component wave function obeys the Schrodinger-type 

i^f=^#), </>(*) = a(t)|K°)+/3(t)|K°) (16.6) 

1) Because of this JC] -» 3jr can occur (albeit rarely), but K 2 -f In. 
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Any transition to other states (m / K°, K ) is considered as K° or K lost from 
this two-component system. Then the probability amplitude in the kaon rest frame 
must decrease with time. The conservation of the probability (or unitarity) means 



dp _ 3(V>1» _ 



■i^(A-A^yj = -^(t)rrp(t) (16.7) 



This means A can be expressed in terms of two hermitian operators M and r 

A = A+A^_ + A-A< _ M _ i_ r ^ M t = M _ rt = r (16 8) 

Then the time evolution of the eigenstate of A can be expressed as 

e- iAt \K)~ex V \-im K t-^t\\K) (16.9) 

Multiplying (K°\,(K | from left and noting that \K°) and \K ) are orthogonal, Eq. 
(16.6) can be cast in matrix form 



.d_ \a(t)l = (K°\A\K°) (K°\A\K ) Ta(t)l = \A n A u l \a(t)l 
at[P(t)\ \(^\A\K°) (f\A\lf)\lP{t)\ U21 ^2 2 JU(t)J 



(16.10) 

Note the mass matrix A has complex eigenvalues and is not a hermitian operator. 
The mass matrix can be expressed as (see Appendix H) 



n K d ij + {i\H w \j)+ J2 P ( '' Hw|W> l W ' Hwli> (16.11b) 



r iJ =2:z J2 d(mj-E n )(i\H w \n)(n\H w \j) 



where i, j denote K° or K and P means principal value. The intermediate states 
n is any state that K° or K can transition through the weak interaction. The above 
formula is referred to as the Weisskopf-Wigner approximation. In this approxima- 
tion weak interactions between final states are neglected and it is valid for time t 
large compared with the strong interaction scale t st ~ l/m K ~ 10 — 23 ~ 24 s. Equa- 
tions (16.11), except for the first term in (b), look like the second-order transition 
matrix Eq. (6.37) introduced in Chap. 6. However, there are some differences. Here, 
the sum over intermediate states is taken only over continuum states to which 
K°, K can decay and does not include K°, K themselves. In the transition matrix 
Eq. (6.37), the intermediate states constitute a complete set and the formula is valid 
for time t much greater than t sX . 
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DiagonalizationofEq. (16.10) changes (a, ft) to (a', [$'), which is a representation 
in the bases [1C X >, I JC 2 > 

xp = a'(i)\Ki) +0'(t)|J&) = o(t)|K°) + /3(t)|K°> (16.12a) 

2 s = ms--r s A L = m L --/l (16.12c) 

We use Ki (2 to denote CP = ±1 eigenstates, but have attached subscripts S, L to 
denote the eigenvalues. The nomenclature K Si L is saved for the weak decay eigen- 
states including CP violation effects that appear later. Note I is a complex number 
but m S| l, r Sl l are real numbers. Representations in the K°, K and K 1: K 2 bases 
are related by 

M = (A L + A s )/2, A A = A L -1 S (16.13c) 

The eigenvalues are expressed in terms of the original mass matrices as 



^s=— -^-z — -±M— ^— — - I +/1 12^21 s MT A A /2 (16.14a) 
Tr[/1] = A s + 2 L = ^11+^22 (16.14b) 

det[A] = 2 S ^L = ^11^22 - ^12^21 (16.14c) 

The symmetry requirements constrain the matrix elements, which will be dis- 
cussed in detail in Sect. 16.2.1: 

A n = A 22 = M , A x = -2VVl 12^21 CPT (16.15a) 

\A U \ = \A 21 \ T (16.15b) 

A u = A 22 = M and \A U \ = \A n \ CP (16.15c) 

Note that for off-diagonal elements (An, A 2 \), only the absolute magnitude can be 
constrained. This is because we are free to change their phase, 



which originates from strangeness conservation in the strong interaction. Or 
rather, the freedom to change the phase is the origin of the strangeness con- 
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servation (Noether's theorem). 2 ' By rephasing the elements |J<C > -»• e'^\K ), 
\K°) -» e-^\K°), 

A 12 ^A 12 e 2 ^, A 21 ^A 21 e~ 2i ^ (16.17) 

Observable quantities do not depend on the phase convention. Note, however, that 
once one has fixed it one has to stick to it, otherwise contradictions will ensue. 
Referring to Eq. (16.11), we see that Eq. (16.15a) means the mass and lifetime of a 
particle and its antiparticle are the same. 

Problem 16.1 

Derive Eq. (16.13) and Eq. (16.14). 

16.1.3 

Strangeness Oscillation 

Strangeness oscillation is a phenomenon that K° and K states transition back and 
forth. Suppose we have a pure | K°) beam at t = 0. It can be made in several ways: 

1. By using the difference of the energy threshold between reactions (a) (E± = 
0.91 GeV) and (b) (£ th = 1.5 GeV) in Eq. (16.1a) and setting the n energy be- 
tween 0.91 andl.5GeV. 

2. By stopping a high-flux low-energy antiproton beam in a pressurized hydrogen 
gas target and selecting the following reactions: 

[k-jz+K 
P + P^\ K+jr - r (16.18) 

The CPLEAR experiment at CERN [109] used a beam with momentum set at 
p = 200 MeV/c and obtained pure K° (K°) by tagging K-JI+ (K + n~). 

3. By using a (j> factory. An electron-positron collider with total center of mass 
energy */s = m = 1019.4 MeV called DAPHNE of INFN (Istituto Nazionale 
di Fisica Nucleare) at Frascati, Italy [237] can produce a coherent K°-K state 
withj pc = 1 — by 

e~ + e + ->• <p , <p -+ K°K° (16.19) 

If both CPT and CP invariance hold, K lt K 2 are the mass eigenstates. Setting 
| K°(t = 0)) = | K°), its time development can be described as 

\K°(t)) = [!*!(*)) + \K 2 (t))]/V2 = llK^e-^' + \K 2 )e-^}/^2 

= g+(t)\K°)-g-(t)\-t) (16.20a) 

g±W = \[e~ Alt ± «" Ast ] (16.20b) 

2) Rephasing would change the CP phase r/ C e ->■ J/cpe 2 ' 5 defined in Eq. (16.3) for consistency, but it 
would not change any conclusion related to CP phase. 
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which shows that a K component appears in the originally pure K° beam as time 
passes. The probability of detecting K° (or K ) at time t is given by 

P(K ° -> K°) = |(K |K°(t)>| 2 = |g+(t)| 2 (16.21a) 

P(K° ^ K°) = |{K°|K (t)>| 2 = |g_(t)| 2 (16.21b) 



= —— cosh— 1± cos A mt (16.21c) 

(16.21d) 

Note, if CP invariance is assumed, P(K° -> K°) = P(K° -» ~K°) and P(K° ->• 
~K°) = P(K° -► K°). Equations (16.21) mean components of K° and ~K° oscillate 
as a function of time. If Am » f ~ T s , the oscillation is too fast to detect K° 
and K separately. However, if Am ~ r s , the oscillation can be observed. Obser- 
vations, show Am ~ r s /2. In Fig. 16.1, however, an oscillation with Am = AT" is 
illustrated to emphasize the oscillation feature. 

The right-hand panel of Fig. 16.1 shows asymmetry data of CPLEAR [107] de- 
fined as 

A = {P(K° -» K°) + P(f -» K°)\ -{P(K° -» if) + P(K° -+ K )} „ 

{P(K° -► K°) + P{K° -+ K°)} + {P(K° -► K ) + P(K° ->■ K )} 
= |g+(t)| 2 -|g_(t)| 2 _ cosAmr 
|g+(t)| 2 + |g-(t)| 2 coshArr/2 

where t is the proper time of the kaon. From the observed curve, A m can be deter- 
mined and is given by 

A m = m L - m s = 3.485 ± 0.01 x 10" s eV 

(16.23) 
= 0.4738 ± 0.0013 T s 

The sign of A m was determined using the regeneration process described in the 
next section. 

In order to observe the strangeness oscillation, K° and K have to be identified as 
a function of time. Since they are eigenstates of the strong interaction, the observa- 
tion must be done in principle not by their decay mode but by their collision mode 
in the strong interaction. Notice, however, that the subsequent decay mode has to 
be identified to confirm what reaction has occurred. For example, in a hydrogen 

3) Their measurement is sufficiently accurate that CPT- and T-violating terms have to be considered. 
Then P(K° ->■ K°) j= P(K° ->■ if), P(K° -+K°)=£ P(K ->■ K°). However, by taking the sum, 
both CPT- and T-violating terms are cancelled and give the same expressions as Eqs. (16.21). 



(16.22) 



624 I W Neutral Kaons and CP Violation* 




Figure 16.1 (a) Illustration of the strangeness 
oscillation. The intensity of a pure K° beam at 
t = decreases as time passes (dashed line) 
and that of K°, which did not exist at t =0, 
appears. If Am ^ 0, the two curves oscillate. 
Here, Am is set equal to r to emphasize the 



Neutral-kaon decay time [t s J 



oscillation behavior, (b) Asymmetry A Am M 
data of CPLEAR of strangeness-conserving 
and strangeness-changing components. The 
strangeness was tagged by the sign of the 
decay leptons [107]. 
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_^_ 



Figure 16.2 K° - 



nixing throug- 2 



bubble chamber, that a K° was produced by the reaction ji p — >■ K°yl at t = can 
be confirmed by detecting subsequent decay of A — >• jr — p (t ~ 2.63 x 10 — 10 s). 
K at time t can be detected, for example, by observing the reaction K p — > ji+A, 
which leaves a visible jr+ track, but must be confirmed by its associated A decay. 
As the momentum and direction of K° can be reconstructed from k^~ and A, the 
flight distance of K can be converted to its proper time. Other usable 



K° + p -+ K+ + n , K + n ^ K~ + n, 

(16.24) 
K +n^jT° + A(A^jz- + p) 

Having said that, we mention another and more practical method which is a special 
feature to K° — K system, that is, to use the A S = A Q selection rule. It is a rule 
that hadronic component in the decay process has to respect because at the quark 
levelitiss^ u+ W~ (see Sect. 15.6). As m K ± > m K a, no strangeness-conserving 
decay mode d ->■ ue~v c (d -> ue + v e ) exists, only the strangeness-changing mode 
s — >• ue~v c (s — >• ue + v e ) can occur, and the sign of the decay leptons can tell us 
which of the parents it comes from. The strangeness oscillation curve of CPLEAR 
in Fig. 16.1 was tagged in this way. Note, the AS = AQ rule is obvious in the 
quark model, but experimentally one has to be careful about the possible e: 
of AS / A Q processes. 




Figure 16.3 Box diagram. (a,b) K°-K mixing in the Standard Model. (c,d) Description of how 
2it states appear in the intermediate states by cutting the mass matrix diagram across the dash- 
dotted lines. 3jt or Jilv states can be constructed similarly. 
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The mass difference A m arises because of mixing between K° and K through 
intermediate states such as those depicted in Fig. 16.2. Namely, it is an effect that 
arises from the nondiagonal elements of the mass matrix. In the language of the 
Standard Model, the intermediate states are given by W ± and/or up-type quarks 
(u, c, t) shown in Fig. 16.3a,b. 

Figure 16.3c,d shows how to associate the modern interpretation in terms of 
quarks with the old picture (2jz, Ijiv, etc.). The mixing is a AS = 2 effect and 
second-order in the weak interaction. Therefore A m should be proportional to the 
square of the coupling constant Gp sin 2 6 C . From dimensional analysis 

A m ~ G F 2 sin 2 d c m 5 K ~ lO -4 eV (16.25) 

which is not too far from the observed value. 

16.1.4 
Regeneration of K-\ 

A pure beam, whether K° or K at t = 0, becomes a pure K 2 beam after a long 
enough time (t » r s = l//s) where r s is Ki's mean lifetime. This can be 
confirmed by the fact that two-prong decay (a topology that splits to two branch- 
es) modes appear in the neighborhood of production (I < l s = /3yr s c), where 
y = E K /m K is the Lorentz factor, but at distances I J>> l s only three-prong decay 
modes are observed (Fig. 16.4). 

Let us consider making this beam pass through matter. K° having S = +1 is 
elastically (K°N -> K°N) or quasi-elastically (K°p -> K + n) scattered at most, but 
K , having S = — 1, can also produce hyperons [Eqs. (16.1b) (e)(f)] and thus have 
larger absorption rate. If we write the states after matter passage as 

\K°)^f\K°), \K°)^J\K) (16.26) 

\K 2 ) = (\K°) - |K°»/V2 -+ (f\K°) -/|K°))/V2 

1 1 (16-27) 

= 2(f+f)\K 2 ) + -(1-1)1^) 

Production Target Regenerati : 

l^ls^ l»l s ►l^—fe-^ 



Figure16.4 Regeneration of K-\. Two-prong decay modes (l< — >• 2jt) are limited to the nei 
hood (/ < Is) of the production and regeneration targets. Three-prong decay modes (K - 
are observed at long distances (/ » Is) as well as at short distances. 
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Since f ^ f, the Ki component, which did not exist before, appears after passage 
through the matter, which is referred to as regeneration. 4 ' The regeneration occurs 
from the difference in the reaction rate and does not depend on which reaction. 
However, depending on the target size, it can be classified as inelastic scattering, 
in which the kaon reacts with individual nucleons in the nucleus; diffraction scat- 
tering, in which it is scattered elastically by the nucleus as a whole; and coherent 
scattering, in which it reacts coherently with many atoms. Among them, coherent 
scattering plays a special role in the study of CP violation and deserves a more de- 
tailed explanation. As there is a small difference between the masses of K\ and K 2 , 
a k 2 of the parent K 2 and k\ of the daughter K\ are related by 



= (m L + m s )/2 , 



-(m L -m s ) = 

i 

= (fei + fe 2 )/2 



Because of the Heisenberg uncertainty principle, scattering from targets within 
the range d < 1/Afc = k/mAm can interfere coherently. If we take k = lOGeV, 
insertion ofm = m K ~ 0.5 GeV, Am ~ 3.5 x 10" s eV gives d ~ 100cm. The 
scattering angle by coherent reaction becomes 



d(l-cos0) < X ~ 1/fe 
6> 2 ~2Afc/fc~Kr 17 



(16.29) 



The momentum transfer to the generated k\ is pr 5 100 eV, which is much small- 
er than the diffraction scattering (p T < lOMeV) and inelastic scattering (p T > 
100 MeV) and it is almost a perfect forward scattering. Figure 16.5 shows an exam- 
ple of pj distribution after regeneration [244]. 
Writing the regenerated state after passage of matter with thickness L as 



^> = |K 2 >+p(L)|K 1 > 
ep(L) is given by [181, 257] 
p(L)= l -^{f(0)-J(0)}l 



(16.30) 



l-exp[{t(Am/r s )-l/2}I/ls] 
l/2-i(Am/r s ) 



/(0). /(0) 



Am = mi — ms 



Number density of the scattering cer 

Momentum of the kaon 

Forward scattering amplitude of K°, 

Mean flight distance of Ki 

Lorentz factor 

Total decay rate of K\ 

Mass difference of Ki and K\ 



4) This is like c 
phenomenor 

siiiiciposiiiol 
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Figure 16.5 Data p|ofK -*■ 7t + K~ of the re- pies [244]. Notice that the width of forward 

generator beam. The Monte Carlo predictions peak is determined not by intrinsic scattering 

for the background components are over- width but by experimental resolution, 
laid. The arrow indicates cut to select sam- 

Proof: As is well known from optical theory, a combined wave of incoming and 
forward-scattered waves added coherently over all scattering centers is well de- 
scribed by a plane wave with wave number nk, where n is the index of refrac- 
tion [176]. The expression for n is 

n» = l + ^M», „>_!«! (16 . 32 ) 



The eigenmass M of the wave is given by 
fe 2 + m \ = (nk) 2 -+ 



-y/m 2 K -(n*-l)V~r 



2jrNf(0) ( 16 - 33 ) 



Since K° and K give different indexes of refraction, the Schrbdinger equatio 
Eq. (16.10) is modified to 



-Kl t\\)]-^[i ?]t] 



t [ml ' 



To derive the solution, we first note that the mass matrix can be expressed in an 
analogous form to that given in Eq. (16.13b): 

iM'-dAx -Aa/2 1 

L -A./2 Jt> + 6Ai\ (1635) 

M' = i(A L + A g )-^(/+/), 6A A = ^(f-f) 
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As 6 is a small number, the solution of Eq. (16.35) is a small perturbation to 
| Xi), | K 2 ) and is expressed as 



\K SM ) ~ \K t ) + d\K 2 ) 

IKlm}-!^)-^) 



I 



(16.36) 
where only up to O(d) was considered. The eigenmass is given by the formula 

Asm = As-— (/+/) ( 16 - 37 ) 

Problem 16.2 

Show that Eqs. (16.36) are the solutions to Eq. (16.34). 

Setting the initial condition as the beam being pure K 2 before it enters the matter, 
we find the amplitude on exiting the regenerator of thickness L is given by 

\K 2 (t)) ~ |K LM (t)> + dlK^t)) ~ \K hu ) e - a ^ + d\K SM )e-' k ^ 
~ {\K 2 ) - dlK^e-^ + dlKje-'*™' 
= e -»W | X2) + (6 »aw_ 1)(5 | Ki) 



Inserting time t = l/fiy to travel distance I and comparing with Eq. (16.30), we 
obtain 

p(L) = (e lA ^y - l)d = -— { 1^J1 ( 1 - e ^IPY) 

_inN - l-exp[{i(Am/AA)-l/2}l/l s ] 

" — {f - f)k 1/2 -i( Am/A) (16 ' 39) 

, /n|2 _ .2 A ,2 1 / - 7l 2 1 + g "' /is - 2e ~ i/(2 ' s) cos(Am/r s) ;/ ; s 



|Amp + r s 2 /4 
In deriving the above expression, we have neglected 71 compared to f s as F s J>> 7~1. 

Sign of Am The sign of A m can be determined if there is another K\ component 
whose intensity is comparable to that of the regenerated K±. 

1. One method is to use a pure K°(K ) beam at t = 0. Suppose the regenerator 
of thickness I = /3yt L is set at distance D = fiyt D from the position of 
beam generation. The amplitude of the beam on exiting the regenerator can be 
described as 

ip(t D + h) ~ \K x )e- a ^ +t ^ + e-' k ^ +t ^{\K 2 ) + p(I)|Ki» 

= e -^^)[|K 2 ) + {^<-+-» + p(I)}|K 1 >] (16 - 40> 
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The intensity of the Ki component is proportional to 

e -iO + L )lk + |p(L)|2 + 2e -( D + D/2ls| p(L) | cos j arg[p(£)] - ^^±ij 

(16.41) 

where l s = @Y l^s is the mean attenuation length of if!. By changing L, the 
phase of the cos term can be measured as a function of I and hence the sign of 
A m can be determined. Using this method, the sign of Am = m L — m s was 
determined to be positive [272]. 
2. The second method is to use a CP-violating Ki component (to be described 
soon) in the pure K L beam. In this case, the K\ component of the beam after 
passing the regenerator is given by 

(JCilVW) ~ ne~ aLt + p(L)e~ Ust (16.42) 

where r] is the CP-violating amplitude (iCi in K L ). The K r intensity is propor- 
tional to 

M 2 e -^' + |p| 2 e- rst + 2|^||p| e - 7t cos[Am + arg(p) - argfo)] (16.43) 

Then by varying the length of the regenerator and measuring the ensuing mod- 
ulation, the sign and magnitude of A m can be determined. Note, however, this 
can be done only after the CP-violating amplitude has been determined without 
the regenerator. 

16.1.5 

Discovery of CP Violation 

CP violation was discovered in 1964, seven years after the discovery of parity viola- 
tion. As long-lived K 2 has CP = — 1, it was thought that it can decay only to 3jt and 
n V^vi, and not to 2jt. Fitch and Cronin [99] observed for the first time that K 2 
decays to 2tt. Figure 16.6 shows their apparatus. 

The experiments was carried out using the 30 GeV AGS (Alternating Gradient 
Synchrotron) proton accelerator at Brookhaven National Laboratory (BNL). Neu- 
tral kaons were extracted from a secondary beam created by bombarding the pri- 
mary proton beam on a Be target through a collimator at 30°. Charged parti- 
cles were eliminated by a sweeping magnet. The beam has average momentum 
~ HOOMeV/c, and a helium bag 5 ' was placed in the decay region at I ~ 20l s 
(decay length of i^) to minimize spurious interactions. To measure decay points 
and momenta of decay tt^ , a pair of spectrometers consisting of a magnet, scin- 
tillation counters and spark chambers were placed in a symmetrical configuration. 

5) This is a poor man's vacuum chamber. One would also have used proportional tubes or drift 
chambers with much less material and fast response instead of spark chambers if modern 
techniques had been available. 
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Figure 16.6 Discovery of CP violation. Lay- 
out of apparatus. Ki beam comes in from the 
left and decays in a helium bag. lit are de- 
tected by a pair of telescopes, which consist 



of a magnet, scintillation counters and spark 
chambers, and determines the pions' momen- 
tum. The helium bag contains regenerators to 
produce 2jt for calibration [99]. 



Furthermore, to calibrate the apparatus, a total of 43 g of tungsten was placed a 
every 28 cm in the decay region. Regenerated K\ has the same momentum as it 
parent K 2 ; it is an ideal calibrator for K 2 — ► 2jz events. Denoting the n 
of decay pions as p v p 2 and assuming the observed particles are pions, a 
reconstruct an effective mass m* of the parent particle from the equation 



= (£, + E 2 ) 1 - ( Pl + Vi) - 2mJ + 2j>u> 2 (1 - cos 6) 



(16.44) 



Events that gives m* ~ m K are identified as decay products from kaons. Back- 
grounds from other kaon decays etc. are eliminated by requiring that the recon- 
structed kaon momentum p K = p x + p 2 is directed in the beam direction. Fig- 
ure 16.7 shows events with m* ~ m K and their momentum in the same direction 
as the beam. A clear signal above the background can be seen in the middle graph 
on the right-hand side. From the data 



BR(/<2 -+ jz + jz~) = 



r(K 2 -> n+TZ-) 



= 2.0 ± 0.4 x 10~ j 



was obtained. In 1967, K 2 ->• jz°jz° was observed, too [110, 153]. 

Since observed kaons with short and long lives were proved not to be e 
of CP, we rename them iC s and K L instead of iCi and K 2 . 
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484 < m* < 494 



^c^ 



nJlru-i 



r^„ 




.jUlrJVUr^ 




^^0^ . 



300 350 400 450 500 550 600 MeV 0.9996 0.9997 0.9998 0.9999 



Figure 16.7 (a) Reconstructed mass distribution [99]. (b) Only data in the region m* ~ m K 
show a peak at 8 = that is above the background. This is evidence for K2 -> 2jt. 

16.2 

Formalism of CP and CPT Violation 

16.2.1 

CP, T, CPT Transformation Properties 

We have to define CP violation parameters to describe the observed phenomena. 
As CP, T and CPT are related, we first list transformation properties of CP, T, CPT 
operations on particle states (see Appendix F). Denoting phase factors rji = e 1 ^, 



CP\q,p,s a :™ t ) = vcp\q,-p,s a -Zt) 

T\q,p,s a :Zt) = Vr\q,-p,s a :£*) 

CPT|q,p,s„ :™ t > = r/ C PT\q, p,-s u :£*) 

»7CPT = rjc?TJT 



(16.46a) 
(16.46b) 
(16.46c) 
(16.46d) 
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where q (q) denotes a particle (antiparticle), p its momentum and s a its spin com- 
ponent. Note T and CPT are antiunitary operators and complex conjugation of c- 
numbers is to be included in their operation (see Sect. 9.2.2). Defining 

0,\q) = JJ^Oiq) 0,=CP,T,CPT 
and using Of = 1, we have 

rji = rji* (16.47) 

By requiring CPT = TCP, we have 

CI>T\q,p,s u .m) = CPrj r \q, -p, -s u .out) = r/ T t] CP \q,p,-s u . out) 

TCP| ? ,p,s„.in) =T)7cp|g,-p,s«.in) = 7]* 7 ri*\q, p,-s a .out) 
where we have taken the complex conjugate of c-numbers. This means 

»?cpt = »7cpt = lien = VcpriT = 1 

?CP = ^cp (16.49) 

rj r = T}$ = ??cpt?/cp = ?7t?7cp . Vr = ricnVcp = ^T^cp 2 

Mass Matrix We use \A) to denote a one-particle state and \B) for continuum 
states to which A can decay weakly. As there is no distinction between "in" and 
"out" for a one-particle state, \A : in) = \A: out) = | A). For spinless kaons in the 
rest frame, indices other than particle species are irrelevant. Applying T invariance 
to the amplitude, 

(i\H w \j) = (i\T~ 1 TH w T- 1 T\j) = (v*Vj(i\H w \j))* 
= r,iTj*(j\H w \i) 

(16.50) 

.-. (n\H w \K°) = r l *r ]n (K°\H w \n) [ ' 

(n\H w \K°)=Jj* Vn (K°\H w \n) 
Applying Eq. (16.50) to the decay matrix in Eq. (16.11), 

r 21 = 2jtJ2^(E -E n )(2\T- l TH w T- l T\n)(n\T- 1 TH w T- 1 T\l) 
= 27tY,d(Eo- E n )ffi Vn )*(Tn\H w \T2)(r}$ Vn )(Tl\H w \Tn) 

Tn 

= farj^jzJ^dlEo- E n )(l\H w \n)(n\H w \2) = V 2 cp r 12 (16.51) 

where \Tn) denotes T-transformed state defined by Eq. (16.46). As T changes the 
particle's momentum as well as spin orientation, | Tn) / | n) , but summation on 
all the continuum states makes the difference disappear. Similar arguments apply 
to the mass part in Eq. (16.11), and 

M 2 i = '7cp m 12 . r 21 = rj 2 c? r 12 (16.52a) 
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This means 

^21 = J7cp^i2 > or M12I = l^2il T invariance (16.52b) 

jt = lt' ° r Im (Mr 2 r 12 ) = (16.52c) 

No constraints are obtained for An and A 22 . 
Requiring CPT invariance gives 

A U = A 22 CPTir 

No constraints are made for A u 

A n = A 2 2 and \A l2 \ = \A 2 i\ CP invariance (16.54) 

Symmetry Properties of Decay Amplitudes Denoting the CPT operation as 0, the 
application of the requirement of its invariance to a weak decay amplitude for a 
process i —>■ f can be written as 

(f:out\H w \i) = (f:out\0- 1 0H w 0- 1 0\i) = (0 f : in\H w \0i)* 

= J2^ f : in ' @ f : ° ut > < / : out i H w 1 @ »»* 

= (0f:out\H w \0i)*e 2,d f (16.55) 

where we have used [see Eq. (6.12)] 

(/ : out|/ : in) = (/ : out|S|/ : out) = S ff = e 2 ' d f (16.56) 

df is the scattering phase shift of the /state as well as / because the strong in- 
teraction is invariant under C transformation. 1 f) is a CPT-transformed state as 
defined in Eq. (16.46). Then Eq. (16.55) means that a decay amplitude and its CPT 
conjugate are related by 

(f:out\H w \i) = Ae id f 

(0f:out\H w \0i) = A*e ,d f (16.57) 

where A is a complex number. Similarly, T and CP invariance can be expressed as 

{/ : out|H w |i) = (/ : out^TH^-^i) = (If : m\H w \Ti)* 

= (Tf:out\H w \Ti)*e 2id f (16.58) 

(/:out|H w |i) = (CP/:out|H w |CPi) (16.59) 
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Neutral K Meson Decays As the K meson is a scalar, 

|0K°) = |CPK°) = |~K°) , |TK°> = |K°> (16.60) 

Since a momentum-inverted final state can be reproduced by rotation from the 
original state and there is no specific orientation in the total angular momentum 
zero state, we have 

\0f: out) = |CP/ : out) = \J : out) , |T/ : out) = | / : out) (16.61) 
Separating the strong phase and defining weak amplitudes A and A by 

(f:out\H w \K°) = A f e' d f, (f : out| H w \ K°) = A je id f (16.62) 

Equations (16.57), (16.58) and (16.59) relate one to the other: 

CPT A(i -+ J) = Aj = A* (i -» f) (16.63a) 

t \M^f)-A fr Ay^ f) 

CP A(i->/) = A(i->/) (16.63c) 

16.2.2 

Definition of CP Parameters 

Now we are ready to apply the formalism we have developed so far to the neutral K 



Mixing Parameters Since we learned that observed neutral kaons are not eigen- 
states of CP, we can write 

l*s)= -, = = (|JCi) + g S |^)) 

Vl+|esl 2 

= 7^T^y< (1+es,|K ° > + (1 " es,l ^ > ! 

I-KL) = , = ■ (|^2)+6l|JCi)) 

Vl+ldl 2 

= 1 1(1 + 6 L )|K°) - (1 - eL )|K°>| (16.64a) 

V2(l+|e L | 2 )< ' 

€s, l 7^ means CP violation. They need not be equal. We decompose them as 

e s = e + d , e L = e-<5 (16.64b) 

We shall show soon that CPT invariance means d = and T invariance means 
e = 0. For the sake of later discussions, we introduce an alternative form for ex- 
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pressing K s and K L : 

\K S ) = pVT^\K°) + qJTT^\t) 

\K L ) = p VT+^\ K°) - qJT^\f) 

\p\ 2 + \q\ 2 = l 
Then it is easy to show that the mass matrix is expressed as 

2q 



HZ 


A 22 \ 


M+| 




M = -(A L + A s ) 


Problem 16.3 




Derive Eq. (16.66). 







(16.65a) 
(16.65b) 
(16.65c) 



CPT and T invariance means A n = A 21 and \A 2 i\ = \A 12 \, which, in turn, 
means z = (CPT), and \q/p\ = 1 (T), respectively. In terms of the mass matrix 
elements, d, e can also be defined as 



d= - 



(16.67) 



The two definitions Eqs. (16.67) and (16.64) are equivalent in first order in e or d. 
Since the second-order term never appears in this book, either definition is valid. 
The former is convenient for theoretical and the latter for experimental evaluations. 
To first order, z, p, q are related to 5, € by 



-2d, 



p-q 



q 1-e p+q 

For theoretical discussions, e is expressed a: 

Im(M 12 ) - i!m(r 12 )/2 

Ar/2-iAm 



-- /' w e'» sw -Im(Mi 2 ) + i 
Am [ 

, 2Am 
v = tan -1 — — = 43.49 ± 0.0 



m(A 2 )l 



(16.68a) 
(16.68b) 



(16.69a) 
(16.69b) 
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where cp S w is referred as the superweak phase for the reason discussed around 
Eq. (16.184). 

To summarize, CP is violated if e s or e L is nonzero. CPT invariance means 6 = 
which, in turn, means e s = e L = e. In this case, CP violation is equivalent to T 
violation and Re(e) / 0. Note e itself is a phase dependent variable as we will 
discuss in more details in Sect. 16.3.2. 

Problem 16.4 

Prove that the solution of Eq. (16.66), where 6,e are defined by Eq. (16.67), is given 

by Eq. (16.64) in first order in d and e. 



Bell-Steinberger Relation Taking the inner product of | K s ) and | if L ), we see 

(JC S |K L ) = 2[Re(e) - ilm(d)] (16.70) 

As this equation shows, the observed states K s and K L are not orthogonal and mix 
together. The amount of overlap is a measure of CP and CPT violation. Its value 
can be estimated approximately as follows. Using the relation 

A\K$) = A S |JC S ) , A\K L ) = 2 l |Kl) (16.71) 
and multiplying by (K L \ and (K s | from the left, respectively, we obtain 

(K L \A\K S ) = X S (K L \K S ) (16.72a) 

(K S \A\K L ) = A L <K S |K L ) (16.72b) 
Subtracting the complex conjugate of Eq. (16.72a) from Eq. (16.72b), we have 

(X L -Xt){K s \K L ) = (K s \A-A<\K L }=-i(K s \r\K L ) (16.73) 
Insertion of (K s | K L ) = 2[Re(e) - tlm(<5)] and Eq. (16.11c) gives 

2 (T+iAm) [Re(c) - ilm(d)] = £,A L {f)A* s (f)p(f) (16.74) 

Al(/) = (f\H w \K L ) , A* s (f) = (K s \H w \f) 

where p(f) denotes the phase space volume of the state/ The expression is phase 
convention independent. The rhs is on shell and the terms are observables, hence 
can be determined by experiments. Equation (16.74) is called the Bell-Steinberger 
relation or unitarity condition, as it is derived from the unitarity relation Eq. (16.7). 
Denoting the partial decay width as F(K Si L ->■ /) = | A s , i(f)\ 2 p(f) and applying 
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the Schwartz inequality to Eq. (16.74), we have 

2\T + iAm||Se(e) - Hm(8)\ < £ V / W)VW) 

/ 



\Re(€)-ilm(d)\< 



^r + t 



This is the interesting result that a bound can be placed on a CP- and CPT-violating 
quantity from knowledge of CP-conserving quantities. 



Time Development of K° and K Time development of a wave function is de- 
scribed by \ip(t)) = e~ lAt \ip(§)). To express it in a more tangible form, we write 
the inverse of Eq. (16.65) 

\k°) = -L (vt^^iks) + vmiK L >) 

/ ( 16 -76) 

|K°) = — (yi + 7|K s ) - Vl^\K L )) 

Then the time development of pure K° at t = is given by 



~ iAt \K°) -- 



2p L 



:|Xs)e- 



= [g+(t) + zg-(*)]|K°) " Vl-z 2 ^g_(t)|iC ) 






1 . 



g±(t)=2( e "' ALt±e "' Ast ) 
Similarly 

|X°(*)> = fe+M - 2g-(*)]l«°> - ^1 - Z 2 |g-W|K°) 

Combining the two equations, we have 

g+(t) + Zg-(t) -^Vl-2 2 g-(n 



yi- z V(t) *+(*)- 



-(*) 
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This is an exact formula, but in most cases approximations to first order in <5 , e are 
more convenient. We list several equivalent formula for later reference. Substitut- 
ing z = -26,p/q ~ (1 + e)/(l - e), we have 

\K°(t)) = \K°)(K°\ e - ,At \K°) + \-t)(K°\e-' At \K ) (16.79a) 

= \K°)[g + (t)-2dg-(t)]+ \t > )[2(A n /A x )g-(t)] (16.79b) 

= \K°)[g+(t) -2dg-(t)]-\t ) )[(l-2 e )g-(t)]+ 0(e 2 ,d 2 ) (16.79c) 

= -^[(1 - e + d)e-^\K s ) + (1 -c - d)e- iALt |K L )] + 0( e 2 , 5 2 ) 

(16.79d) 

|K°(t)> = |X°)(X | e - M '|X°) + |K°)(X |e- M *|X°) (16.80a) 

= I «°)[g+(t) + 2dg_(t)] + | K°)[2(yl 12 /A A )g_(t)] (16.80b) 

= |"K°>[g+(t) + 2 ( 5g_(t)]-|K )[(l + 2 C )g_(t)]+O( e 2 , ( 5 2 ) (16.80c) 

= -J=[(l + € - d)e-'^\K s ) -(l + c + d)e-^\ Kl )] + 0(e\ d 2 ) 

(16.80d) 

Formulae with CPT Invariance Assumption As d ~ experimentally, and since 
many expressions can be simplified, we assume CPT invariance from now on until 
we test it later in Sect. 16.4. Setting z = 0, we have from Eqs. (16.14) and (16.15) 

A U = A 22 = M, l s l = A u ± ^A 12 A 2 i = A n ± -A 12 

P 

= A n ± -A 21 (16.81a) 

q 

Au = _^AZ, An = _*L± t ^ + ^ 12 = _ Aa + 0(e 2 ) 

2 q 2 p 

(16.81b) 



M* 2 -ir*/2 
I m 12 - ir 12 /2 



A A = -2 V^li2^2i = -(^21 + ^12) + 0(e 2 ) ~ -2 I ReM 12 + -Rer 12 J 

(16.81d) 

The above equations are obtained simply by setting z = in Eq. (16.66). Note, by 
rephasing [Eq. (16.17)], A u -> e 2ii ^A 12 , A 21 ->■ e~ 2 ^A 21 , but also by Eq. (16.81c), 
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q/p -*■ (q/p)e~ 2 '^, therefore (q/p)A u and (p/q)A 2 i are phase-independent 
variables. They should be, as A^ is an observable. Then our choice of phase 
Eq. (16.64a), or equivalently Eq. (16.68b), constrains the phase of A 12 and A 2 i, too. 
Actually, by virtue of Eq. (16.81b), A l2 , A 21 = A A /2 + 0(e). Therefore both M 12 
and F 12 are very close to real numbers and their phase is small [~ O(e)], 

.-. Am = m L - m s = -2Re (^A l2 A 2l ) ~ -2ReM 12 (16.82a) 

Ar = r s - r L = 41m (JA^A^\ ~ 2Re/\2 6) (16.82b) 

AmAf = -4Re(M 12 r i j) (16.82c) 

ImM 12 « ReM 12 , Imr 12 « ReA 2 (16.82d) 

Here ~ means a small deviation due to CP violation. If CP invariance is assumed, 
the relations are exact. 

Problem 16.5 
Derive (16.82). 



16.3 

CP Violation Parameters 

16.3.1 

Observed Parameters 

2jt Decay Parameters »/+_, i/ o The 2jt CP-violating parameters are defined by 

_ _ (7T+7T- | H W | K L ) 



(7T+7T-|H W |Ks) 
_ (JT 7Z°\H W \K L ) 



|f ?+ _|e , *+- (16.83a) 

?7oo|e^ 00 (16.83b) 



|?7-l | and | ?7oo I can be determined from measurement of the 2ji decay ratio. The 

phase angle 0^ , 0oo as well as the sign of Am can be determined from the 

interference effect between K s -*■ 2jt and K L -> 2tt. Suppose there is a beam 

6) Note our definition of Am and Af is to in the absence of CP violation, A u = 

make them both positive. This is possible M 12 — i/12/2 = —A>J2 = —Am — iAF /2. 

for the neutral kaons because they were The physical quantity does not depend on 

use Am = m L — m s , AT = T L — T s or variables may change, so one must be careful 

vice versa (S O L) for a general treatment about the sign convention when reading the 

including neutral D and B mesons. literature. 
With our choice of phase convention and 
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expressed by 

IV(*)> = \K L (t)) + p\K s (t)) = \K L (0))e- iht + p\K s (( 
The 2jt decay amplitudes are 

(2jt\^(t)) = l V e- a ^ + pe-'^](2jz\K s ) 
As the number of 2jt decays is proportional to | (2:; 
N 2ir = N K | pe (iAm-A/2)« + r, e -{W)t\ 2 



= N K {\,j\ 2 e- r L 
= arg(p) , 0, 



H |p| 2 e~ 
arg(»7 + 



r ?7oo) , 



cos(Amt + p 
= (A + 7i)/2 



(16.86) 

One can prepare p in two ways. 

(1) In the first method, a pure K°(K ) beam is produced at t = 0, in which case 
p = ±1, <p p = 0. If N K o = N, N^o = N at t = 0, one substitutes \p\ 2 ->• 1 in the 
second term and |p| -»■ (N - N)/(N + N) in the third term of Eq. (16.86). In order 
e the interference effect, one needs 



|p| exp(-rt) ~ M ~ 2 x io- 



(16.87) 

This means that the detector is to be placed in the neighborhood of I = /3 y t ~ 12 l s 
where Is = fiy /A is the mean decay length. From the time dependence of the 
interference effect, one can determine both Am and (p. Figure 16.8 shows the In 





Figure 16.8 Decay curves of pure K° and K 
interference, (b) K° and ~K° separately [107]. 
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Figure16.9 Asymmetry curve of neutral K — »• : 
beams [107]. 



ially pure K° and K 



decay curves. When decay products from both pure K° and K beams at t = are 
added together, there is no interference as is shown in Fig. 16.8a, but separately 
they show the interference effect. Figure 16.9 gives an asymmetry curve defined by 

A NMP = -1) ~ N 2 ,(p = 1) 

+ ~ N 2jr (p = -1) + N 2 ,(p = 1) 

2 \ v , | e (/-s-/l)*/2 C os( A mt - ,) 



The world average values of the decay parameters are given by [311] 7 ' 

\ij + -\ = 2.233 ± 0.012 x 10" 3 

?7ool = 2.222 ± 0.012 x 10" 3 

(16.89) 
(P+- = 43.51 ±0.05° 

r> 00 = 43.52 ± 0.05° 

Historically, however, both parameters were determined by the second method, 
using regeneration. 

(2) The second method uses a regenerator to produce a iC s beam. Given the 
regenerator thickness L, we have p = p(I) given by (16.31). The measured phase 
is Ami and <j> p — <p,j. Since both t and I (which resides in (p p ) can be varied, (p n as 
well as the sign of A m can be determined. If one observes the semileptonic decay 
mode, one can determine (p p because 0, ; = 0. Usually, the regenerator method is 
accompanied by systematic errors associated with <p p , and it is not quite an ideal 
method. However, if one measures both jr+ ' n~ and 7r°jr° modes and takes the 

7) These values were determined using all the methods discussed in the following. The values 
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difference, the systematic error associated with p almost cancels and a very precise 
value of ^oo — 0H can be obtained. The most recent experimental data give [244] 

A0 = (poo- <P+- =0.39 ±0.45° (16.90) 

which is consistent with O o = 0H A global analysis by the Particle Data Group 

gives a much better value [311]: 

A0 =0.006 ±0.014 (16.91) 

A0 gives a measure of CPT violation [Eq. (16.175)], which will be discussed later. 

Asymmetry Aj_ Since the three-body leptonic decay modes are not CP eigenstates, 
the CP violation effect appears as the difference between the decay mode e~*~ v e n~ 
and its CP conjugate e~v c ji + , that is, as the asymmetry A L of the kaon with longer 



The asymmetry also appears in the muonic decay mode. K s can also decay to them 
in principle but, the 2jt decay rate being much larger, the branching ratio is small: 

riK s ->■ evil) r(K L -)■ evil) 71 , 

-*-^ '- ~ -^ '- = BR(K L -> evji)-± ~ 10" 3 (16.93) 

The CP violation is expected to be even smaller by another factor of 10 — 3 and is 
more difficult to measure. 

The asymmetry can be determined as follows. The transition amplitudes for K! 3 
are defined by 

A+ = (e + vji-\T\K ) , A- = (e-vjr + \T\f) : AS = AQ 

(16.94a) 

A- = (e~vjr + \T\K°) , A + = (e + vjT~\T\~K°) : AS = -AQ 

(16.94b) 

We assume here that there are no AS = — AQprocesses and set A_ = A_|_ = 0. 
CPT invariance constrains A_ = (A+)* [Eq. (16.63a)]. The asymmetry of K s , K L 
decays is calculated to give 



' r(K SiL ^ e +v c jr-) + r(K S:L ^ 
= |(l + Cs ,L)A-| 2 -|(l- es ,L)A + | 2 

|(l + 6 S , L )A_P + |(l-6s, L )A + | 2 

= 2Re(e s , L ) = 2Re(e) ± 2Re(<5) 
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where Eq. (16.64a) was used to derive the second equality. We retain the nomencla- 
ture e s , l = 6 ± & to remind us that A s and A L are different, although <5 = with 
CPT invariance. It is a constant independent of time if a pure K s , L beam is used. 
Under normal circumstances, the neutral kaon is produced as a mixture of K° and 
K . Since observing the charge of a decay electron is equivalent to selecting the 
flavor with the AS = AQ rule, the asymmetry shows the same time dependence 
as that of the strangeness oscillation. But for long time t ~2> lO/Ts after produc- 
tion, the identity of the parent does not matter and the residual asymmetry agrees 
with A L given by Eq. (16.95). Figure 16.10a,b shows asymmetries of e and ft as 
a function of time. One can recognize interference and residual asymmetry. The 
asymmetry is given by 

A L (e) = 3.34 ± 0.07 x 10" 3 

A h (fi) = 3.04 ± 0.25 x 10 -3 (16.96) 

A L (e + (i) = 3.32 ± 0.06 x 10" 3 

Note the oscillation is not a CP violation effect. It is induced by the mixing and 
small mass difference of the kaons. 

The decay asymmetry A s is important for deriving information on 6, but it is 
difficult to determine as it has to be measured immediately after production of K° 
or K . Recently it was measured for the first time by the KLOE group [235], though 
not so accurately that it can be used for other analysis: 

A s (e) = 1.5 ± 9.6 stat ± 2.9 syst x 10" 3 (16.97) 



Show that by observing the flavor-tagged asymmetry into the same-sign semilep- 
tonic mode one can extract 6s, despite the long time passage, to extract the asym- 
metry value: 



r\T(t = ())->■ !±v!7rF] + r[K°(t = o> -> i± 

^^ 2Re(6 S ) 
This is another method of measuring e s = e + d. 



16.3.2 



The two most important CP-violating parameters that play a central role in the 
theoretical interpretation of the neutral kaon decays are e and e'. e is a CP-violating 
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mixing parameter while e', defined in Eq. (16.103c), expresses a CP violation effect 
in the decay, e is often referred to as the indirect and e' as the direct CP violation 
effect. 8 ' 

Phase Convention 

We investigate the interplay of CP violation parameters e, e',r/-\ , and »7oo and 

clarify their roles. Since the neutral kaons decay predominantly to 2tt channels, we 
first analyze the 2jt decay mode. The 2jt amplitude can be decomposed to isospin 
states 9 ' 



A(K" 


-+*+*" 


1 

~7? 


(Via 


+ M) 


A(K° 


-V) = 


-U 


-A + 


•/1A 2 ) 


A(lV 


->■ Jl + Jl° 


■VI 


A 2 





(16.100a) 
(16.100b) 
(16.100c) 



The 7 = 1 state is antisymmetric and is not allowed by Bose statistics. The decay 
amplitudes, taking into account the final-state interactions, can be expressed as [see 



8) There is yet another type, referred to as the 
"mixed interference" CP violation parameter, 
defined by 



common to K° and K. The CP-violating 
effect is defined by Im(A/) ^ 0. In the 
kaon decays, it is of little use but it plays ai 

(16 99) 9 ) We deal wim tne symmetrized state 

jt+jt- = (jt+jt- + j l -jt+)/^/2. Normal 



appears in the interference between a 
without mixing K° ->■ / and a decay 
rixing K° -+ K° -> f. Such an effect 
only in decays to final states that are 
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Eq. (16.62)] 

Ti = (In, I\H W \K°) = Aic ib] (16.101a) 

Ti = (In, I\H W \K°) = Aie id ' (16.101b) 

where 5i are scattering phase shifts of isospin I. CPT invariance means Aj = A* 
and CP violation means A j / A*. Next we define 

a _ (I = 0\H w \K 2 ) _ (0\H w \K°) - (0\H w \t) 
(I = 0\H w \K 1 ) ( | Hw | K 0) + ( | H w \K°) 

A — A . ImA 

= — - — J- = i (16.102) 

Ao + Ao ReA 

Here we have used (0|, (2| to denote 2ji, I — 0, 2 eigenstates. We also define 
CP-violating decay parameters for I = 0, 2: 

(J = 01H W 1K L ) = (0|H w |K 2 )+g(0|H w |iC 1 ) 
C ° (I = 0|Hw|Xs) (0|H w |ld)+c(0|H w |^) 
ImAo 
^ + a = e + l ^A~ < 16 ' 103a > 

(J = 21H y |K L ) _ 1 (2|H w |K 2 )+g(2|H y |JC 1 ) 

(16.103b) 

(16.103c) 



V2(/ = 




l^s) 


V2(0|H W 


K x )+ ei 


;o|h v 


:-\K 2 ) 


e' + e co 
















1 (2|H, 


|JC 7 ) 






rimA- 


, ImA 


:l 




V2(0|H W 


'|Ki) 


- aco 


- 1(0 


|_ReA ; 


, ReA 




1 (2|H„ 


■|Ks> 


1 


Re A 2 


gH02-t 


'Hi 






V2(0|H„ 


|X S ) 


~ V2 


ReA 











where (5 , <5 2 are 2tt scattering phase shift in I = 0, 2 states. Note that e , e 2 and w 
(hence e' also) are observables and are independent of the phase convention. 

Proof: 

= (f\H w \K L ) = (/IHwljpl^-glK )} = !_ A/ 

^ ^"-'^ (/|H w |{p|K0> + ? |K°>j 1 + A/ (16104) 

_/<A (f\H w [t) _(q\A f 
1 \p)(f\H w \K°) \pjAj 

Byrephasing|K°) -► e _if |JC°), |K°> ->• e^\~K°), we have Ay/Ay -» e 2, ^Ay/Ay 
and g/p — >• e~ 2 '^(q/p) [see discussions following Eqs. (16.81)], therefore 2y is 
phase independent. □ 
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The relative amplitude en violates the A I = 1/2 rule. Its absolute value and phas 
can be obtained from the experimental data. If p indicates the phase space 



-BR(K+ -► jr + jz )— — = (3.19 x 1(T 2 ) 2 

3 t K+ 

(16.105a) 



From the above relations we obtain arg(co (observed — ±55.6° , which is 
it with that given by Eq. (16.103d), namely [120, 157, 344] 



Since the phase of | K°), | K ) can be chosen arbitrarily [Eq. (16.17)], we are free to 
change e. This can be shown as follows. By rephasing 



2A A 2A A 

yl 21 (l - 2ig) - yl 12 (l + 2ig) 

2A A 



~6+i| 



where we have used the fact that CP violation is a small effect [i.e. O(eA^)] and 
only needs a small phaseshift to redefine e. Using this freedom, we can change e 
to a more convenient form. Choosing f = ImA /ReA , we can make e equal to e . 
Namely, the CP-violating decay parameter and the mixing parameter are the same, 
a convention we adopt from now on. 10 ' In this case one has to bear in mind that 
K is defined by 

CP|K° >= e 2 ^|~K°) (16.108) 

Note that in many discussions outside the realm of neutral kaon decays, includ- 
ing those about the Kobayashi-Maskawa matrix, f = is assumed. So one must 
be careful about the phase convention. Since we chose to set e = e , which is a 
phase-convention-independent parameter, we shall use e when we need the phase- 
dependent mixing parameter. Note, many authors use e to denote both the phase- 
independent 7=0 decay amplitude and the phase-dependent mixing parameter 
interchangeably. 
Then e defined in Eq. (16.69) is modified to 



ReA 



10) Historically, Wu- Yang's phase choice [3 
eliminate iImA /ReA was adopted, ar 
this phase choice e = e . We use the sa 
but ImAo/ReAo is retained explicitly. 
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((sin^swe^™) -1 = 1 + i cos 0sw/sin S w = 1 + iAr/2Am (16.110) 
e can be rewritten as 

. l0sw [ Im(M 12 ) Im(A2) l , .f haiju) , lmA \ 

= e T +id(p (16.111a) 

lmir 12 ) ImAo 

We shall show that <3<p is a very small number. 
Experimental data show 

BR(JC S -^2jt + (y)) = 99.89%, 

BR(JC S -+ 3jt) = 3.5 x 10" 7 , 

BR(Ks ->• e ± v e 7t :f ) =7.04xlCr 4 , (16.112) 

BR(JC S ->• (i^v^n*) = 4.69 x 1CT 4 

All other decay modes < 10 -5 

One notices that 2jt channel including photon emissions is overwhelmingly the 
dominant decay mode. Since 7"s 3> /!> A — r 2jr , it is a good approximation to 
use only the 2jt intermediate state in the evaluation of /\ 2 . Note that the leptonic 
mode ( K — >■ I v j jr) does not contribute to 7"i 2 if the A S = A Q rule is exact. Setting 
the phase space volume as p and using Eqs. (16.11) and (16.101), we have 

Im(r 12 ) ~ \lm( (K°\H W \I = 0)(I = 0\H w \K°)) + (i =2 term)! p 
= Im(A* A + A*A 2 ) p =^L Im[(A*) 2 + (A*) 2 ] p 
= -2(ReA ImAo + ReA 2 ImA 2 ) p (16.113) 

A ~ [|<J = 0\H w \K°+ K°)/V2)| 2 + (7 = 2 term)] p 

~ 2 [(ReA ) 2 + (ReA 2 ) 2 ] p (16.114) 

Im(r 12 ) _ Im(r 12 ) ^ ImAo 2 [ImAo ImA 2 l 

" Ar ~ r s ReA + W |_ReA ~ ReA 2 J 

^-^l ± l\^\ (16 . 115 ) 

ReA ' ' l ' 

Using experimental values [<o ~ 0.03, \e'/e\ ~ 10~ 3 ; see Eq. (16.183)], we see that 
the second term is at most ~ 10 — 5 of the first term and can be neglected. This 
proves our promised statement. 
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We see that e T is completely aligned along the superweak phase. Using Am~ 
-2Re(Mi 2 ), Ar ~ 2Re(A 2 ) [see Eq. (16.82)], e T can further be rewritten as 

■ A r-arg(M,*,Ji2)l id , r2Im(M*ri 2 )l 

(16.116) 

where A mAF = — 4Re(M* 2 /\ 2 ) has been used in obtaining the last equality. Note, 
T invariance means Im(M*r 12 ) = [see Eq. (16.52c)]. Of course it should be as 
T and CP are equivalent under the assumption of CPT invariance. But it is good 
to remember that e r , if defined exactly by the second equality of Eq. (16.116), is 
intrinsically a T violation parameter and will reappear later in Eq. (16.148) in the 
discussion of T invariance. 

Conventionally, throughout most literatures, e defined by Eq. (16.111) is used. 
However, as d(p is small, the following discussions are valid irrespective of whether 
e or €t are used, except in trying to determine the value of dtp itself. Note also, that 
e itself is a phase dependent variable, but its real part Re(e) is an observable and 
is phase independent as can be seen in the expression of the decay asymmetry 
parameter A L [see Eq. (16.98)]. 

Evaluation of ( 

To determine e and e' , we express them in terms of observables. We relate the 2jt 

decay amplitudes r]^ , ?/ o to the isospin parameters e and e'. 

Using the inverse relation of Eq. (16.100) we have 

(I=0\H w \K L ) = Jl(+-\H w \K L )-Jj(0O\H w \K L ) 

= Vi V+ ~ vl (/ = ° |Hw,|Ks> + V3 <; = 2|Hw,Ks> (16-117) 

- y \v«> -y|<I = 0\H w \Ks) + y|(I = 2\H W \K S ) 
and dividing by ( I = 0| H w \ K s ) we obtain 

e = (3^+- + 3^ooJ + -(V+- ~ 7oo)« (16.118) 

Similarly 

e 2 = (lv+- ~ j»7ooJ + \(V+- + 2)7oo)w (16.119) 

Conversely 
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As e' « e experimentally [see Eq. (16.183)], 



s a good first approximation. Figure 16.11 shows a sketch of Eqs. (16.120). 
Experimentally, using Eq. (16.89), 

|»+_| ~ I wool , <t»+- ^ 0oo = 43.51 ± 0.05° ~ </> sw (1 



At this stage, it is convenient to define components parallel and perpendicular to 
gi0sw m me complex plane. Then 

e = <?||e N + ej_e_L (16.124a) 

en = e^ sw = cos0 sw + isin^sw (16.124b) 

ej _ = je'^sw _ _ sin sw + i C os0 sw (16.124c) 

We will learn later that the components of e, e' perpendicular to ey are related to 
CPT violation. The experimental observations Eq. (16.123) mean 



and that e can be determined very accurately by the relations 

| C | = (2|?7 + _| + |,/oo|)/3 
^ = (2|0+_| + |0ool)/3 





PrX« 


Ly/ / 


/ 

\t i : ' A " 


1 // / 1 





Figure16.11 Sketch of Eq. (16.120). Not tc 
it is phase dependent, but e = eo is not. 



i. Since we redefined e as the mixing parameter, 
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lei = 2.229 ± 0.012 x 10" 3 

(16.126) 
<p € = 43.51 ± 0.05° 

Direct CP Violation 

While e represents an indirect CP violation in mixing, e' is a direct CP violation in 
decay amplitudes to the J = 0, 2 state. To see that e' represents a CP violation in the 
decay, let us find a necessary condition for the two CP conjugate decay rates to be 
different. Note, the total decay rate is guaranteed to be the same by CPT invariance, 
but the partial decay rate to a particular final state /can be different if CP is violated. 
The partial decay amplitude is expressed as [see Eq. (16.57)] 

T(B -► J) = X] Af eidj (16.127a) 

T(B^ f) = J2 A J eidi (16.127b) 



where j labels different intermediate states and dj is the scattering phase shift. CPT 
invariance constrains Aj = A* and T invariance means Aj = A*. Therefore, 
under the assumption of CPT invariance, CP violation appears as a phase of the 
amplitude. Let us write 

A] = A* = \Aj\e^ w J = ajj*^ , Aj = a i e~ i4 ' w i (16.128) 

For the experimental sign of CP violation in the decay 

dr = r(B -+J)-r(B ->• /) /o (16.129) 

the interference effect is necessary, which requires at least two intermediate states. 
Writing 

T(B -> J) = a 1 e idl e i 4> w i + a 2 e id2 e i "'^ 

., , (16.130) 

T(B ■+ f) = ai e' 6l e-^^ + ^e^e"'^ 

r(B -+J)- r(B -► /) = a x a 2 sin(0 Wl - W2 ) sin^x - <3 2 ) (16.131) 

The above equation means that the two states must have both different strong 
phase and weak phase. 

Next we want to express the decay rate difference in terms of the CP violation 
parameter rj f that we defined previously [Eq. (16.83)]. 
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Problem 16.7 

Show the decay asymmetry at time t is given by 

Af s -^ " — = 2Re ( e " If) 

r {K -+ f) + r(K?^f) (16132) 

_ (f\H w \K L ) lmA f 

' lf (f\H w \K s ) f l ReA 7 

which is a restatement that if the final state is reached via a single route, the asym- 
metry does not appear. Show also 

A f (jt°jt°) = 2(6 - ?/ oo) = V , A f (ji+jt-) = 2(e - v+-) = -W 

(16.133) 



This explicitly shows that e' represents the difference of two CP conjugate decay 
rates. 

Note, the CP violation parameter e' defined in Eq. (16.103c) can be expressed in 
the form 



/ rimA 2 ImA l\ 



which is of the form Eq. (16.131) and shows that I = 0, 2 states provide the two 
different routes for decays K -*■ Jt + n~ and K -*■ jt°jz . 

Problem 16.8 

Show that e' can also be defined as 



I, i/Ia^oI \a„ + „-\\ 



Evaluation off' 

From Eq. (16.120) 



« - ^(V+-~Voo) 
It is convenient to use the decomposition defined in Eq. (16.124): 



= \m(e'/e) 
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arg(e') = arg(o>) = - + d 2 - 6 = 42.3 ± 1.5° ~ S , 



Using ?7 + _ ~ ?7oo, 0+- ^ 0oo ^ 0sw 



When both 77 _| and )7oo are measured it is possible to determine ejj in principle, 

but with poor accuracy. Experimentally Re(e'/ e ) can be determined directly [51, 
244]. Using Eq. (16.120) 



c ( *') _ i (\ n+ - r i\ _ i (\ n+ - 1 ii = 



1.65 ± 0.26 x 10" 3 



where we have used |?7_| /?7oo| 2 — 1 to derive the third equality. A</> can also be de- 
termined accurately using the value of Re(e'/e). Since O o/0H — 1, Eq. (16.120) 

A0 = 0oo - 0+- ^ -3Im(€'/€) ^ -3Re ( — ) tan(0 € > - </> S w) 

V e / (16.142) 

~ 0.006 ± 0.008° [311] 



16.4 

Test of T and CPT Invariance" 1 



We have observed CP violation, but if CPT invariance holds this is equivalent to 
T violation. The foundation on which CPT invariance is established is very solid, 
as stated in Chap. 9. It can be proved with very few axioms, such as the principles 
of quantum mechanics and Lorentz invariance. Namely, to deny CPT invariance 
is equivalent to denying quantum field theory. One wonders if the observed CP 
violation is T violation with CPT invariance or genuine CP violation, which holds 
regardless of CPT invariance. 

The neutral kaon system provides test benches for the most rigorous determina- 
tion of T and CPT symmetry and a variety of other fundamental principles. This is 
because the mass difference (Am~3.5x 10 _s eV) is so small that its effect appears 
as macroscopic interference phenomena and enables CP and CPT parameters to 
be determined with an accuracy no other means can provide. In this section, we 
start with no assumption about CP or CPT ir 

11) See [48, 68, 121,269, 299]. 
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Definition ofT- and CPT-Violating Amplitudes 



The starting point is Eq. (16.55). We first write the decay amplitudes for K ->• 2jt 
and K13 are expressed as sum of two amplitudes: 



(2jt,I\H w \K°) = (A 7 + B 7 )e" 
(2jt,I\H w \K°) = (A, + B/)e ic 
( e +v e7 r-|H w |Jf°) = (»+b) 



(e- 



(16.143a) 
(16.143b) 
(16.143c) 
(16.143d) 

rewrite A/ = A*, 
g and B / , b represent 



-|H w |K) = (a+fe) 

IfCPT invariance holds, A+~B = (A+B)* [Eq. (16.63a)], 
a= a*,~Bj = -B*,b = -fa*, A 7 , a represent CPT- 
CPT-violating terms. Furthermore, if we write 

A 7 = ReA 7 + ilmAj , B 7 = ReB 7 + ilmB; 
a = Rea+iIma, b = Reb + ilmb 

T invariance means A* = A,, B* = B 7 and CP 
B i = B ; . Consequently, what each term represents is 

ReA 7 CPT, T, CP 

ImA, CPT, -f, G$ 

ReB 7 JSPT, T, J2T 

ImBj JSPT, % CP 

If we can demonstrate smallness of both direct and indirect CPT violation param- 
eters, i.e., |B 7 | «. | A 7 1, 6 <JC e, the assumption of CPT invariance is justified. 
Similar logic can be applied to a, b. 

16.4.2 

T Violation 

If observables such as transverse polarization (e ■ p 1 x. p 2 ) in the 3-body decay 
K° ->• Jt ± {pi)^(p 2 )v fl or the electric dipole moment (EDM) are detected, they 
are a sure sign of T violations [see Sect. 9.2.2] independent of CP or CPT violation. 
Unfortunately, they have not been observed and there is no other evidence that 
T invariance is violated. Therefore we investigate whether we can show it in the 
neural kaon system. T invariance means [19] 

T[{¥"*(t f )\0 ta (ti))] = (0 mt (t f )\W^(ti)) (16.145) 

Setting <P = K°, W = ~K°, we have 

lCCV/)l-C(*i)>| 2 = l(^° ut (t/)l<(ti))| 2 (16.146) 
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Therefore, time reversal violation can manifest itself in an asymmetry parameter 
known as the Kabir asymmetry [221] 

= P[*V = (>)->• K°(t)] - P[K°(t = 0) -> *°(t)] 

P[K°(t = 0) -)■ X°(t)] + P[K°(t = 0) -> K°(t)] 
= KJCole-^'l^^-KKole-^'IKo)! 2 

|(Ko|e-^*|Ko>| 2 + |(K | e -^ ! |Ko>| 2 
_ \2(A l2 /A,)g-(t)A f \ 2 - \2(A 21 /A,)g.(t)Aj\ 2 

\2(A n /A x )g-(t)A f P + |2(yl 21 /Ai)g_(t)A 7 P 

|/1 12 | 2 {1 - 2Re(y)} - |/1 21 | 2 {1 + 2Re(y)} 

L4 12 | 2 {1 - 2Re(y)} + M 21 | 2 {1 + 2Re(y)} 

\A U \ 2 - \A 21 \ 2 

"krFrWp- 2Re(y) (16 - 147a) 

A / = (/|H w |K°> = a+b=a(l-y) 

Aj = <7|H W |I°> = a* -&* = «* (1 + y*), y = -b/a (16.147b) 

where /and / are decay final states to tag the flavor. We have used Eqs. (16.79b) 
and (16.80b) in deriving the third line. No assumption on CPT invariance is made 
here. Apart from CPT violating parameter Re(y) which can be tested separately, T 
violation means \Au\ / \A 2 i\, which confirms the statement in Eq. (16.52b). 



„ 2Im(M*A 2 ) 



= sin0 sw ^™- sin{arg(r 12 ) - arg(M 12 )} 

where S w. Am and AT are defined in Eqs. (16.81). e T is related to e by e = 
e T + idcp [see Eq. (16. 111) 12 ']. Using this e T , the asymmetry A T is expressed as 



The value of Re(y) can be evaluated from the asymmetry of semileptonic decays 
[see Eq. (16.155)] and is known to be consistent with zero. In the analysis here, it 
was assumed to vanish. 



12) Note, if e T is defined by Eq. (16.148), dip will be redefined by this relation. 
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0.03 r 
0.02 E- 



♦ ±^+ ♦ 1 + 1+: 



-++- 



*# 



Figure 16.12 Direct T-violation effect observed by CPLEAR [107,1 



€t 7^ means T violation whether or not CPT invariance is violated. Note A T 
is constant and does not depend on time. K° ->■ K , K ->■ K° processes can 
be identified using lepton charge tagging, i.e. K° by e+ v e M: _ and K by e _ r e ^+ , 
assuming the strict A S = A Q, rule. 13 ' Figure 16.12 shows the CPLEAR measure- 
ment [107, 108] of the asymmetry A T , which is clearly nonzero. 14 ' 

This is the first, and so far only, directly observed T-violation effect. The value is 
determined as 



4Re(e T ) = 6.2 ± 1 



It ± l-0 Sya 



< up 



(16.150) 



16.4.3 

CPT violation 



CPT invariance requires A n = A 2 i [see Eq. (16.15a)], or equivalently 6 = 0. 
Therefore an observable similar to A T that can test CPT violation is given by 

P[K°(t = 0) -> K°(t)] - P[K°(t = 0) -» K°(t)\ 

A CPT = — us =o 

P[K (t = 0) ->• X (t)] + P[JC°(t = 0) ->• K°(t)] 

= KKple-^'l^o)! 2 ~ |(JCo| e — "IJCq)! 2 

KJCok-^WP + KKok-^'IKb)! 2 

If the final state is tagged by charge detection of the decay lepton, the leptonic decay 
amplitudes in Eq. (16.94) are accompanied by the direct CPT-violating amplitude 
b = —ya and are expressed as 



(K \e-' At \K ) = a*(l + y*)[g + (t) + 2dg-(t)] 
{Ko\c- Ut \Ko) = o(l - Y)[g+W ~ 2dg-(t)] 



(16.152a) 
(16.152b) 



13) Experimental anal^es arc UMialK carried out without assuming the AS = AQrule, but so farm 
effect has been observed. 

14) Because of different noi rnali/ation, what CPLEAR actually measured is A T = 4Re(e) — 4Re(y). 
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Figure 16.13 AsymmetryA^ versus the neutral-kaon decay ti 
represents the result of the fit [107]. 



, 1(1 + y)*[g+(*) + 2^g-(t)]| 2 - 1(1 - y)[g+(t) - 26g-(t)f 
' 1(1 + V)*[g+(t) + 2<5g_(t)]| 2 + |(1 - Y )[g+(t) - 2<5g_(t)]| 2 
_ Re(rj) sinh Art/2 + Im(r3) sin Ami 

KY ' cosh Aft/2 + cos Ami 



4Re(<5) + 2Re(y) 



(16.152c) 



Re(f5) and Re(y) can be determined by observing residual asymmetry. Figure 16.13 
shows CPLEAR's experimental data on A C pt- 15 ' The data show no sign of CPT 
violation. 
The obtained values are [107] 



Re(r3) = 2.9 ± 2.6 sta t ± 0.6 syst 



10~ 



(16.153) 



The value of Re(y) can be determined using the asymmetry of semileptonic decays. 
Including Re(y) in the expression for the asymmetry Eq. (16.95) we obtain 



L = 2[Re(6)±Re(<5)-Re(y)] 

A s + A L = 4[Re( e )-Re(y)] 



(16.154a) 
(16.154b) 



A s was measured in KLOE [235]. However, the most accurate value of Re(y) o 
from the analysis of the Bell-Steinberger relation and is given by 

Re(y) = 0.4±2.5x 10" 3 (16 
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Im(<5) can also be determined in principle from K13 decay asymmetry by observing 
its time variation, as is described in Eq. (16.152c), but again its most accurate value 
can be determined by using the Bell-Steinberger relation. 

Analysis Using the Bell-Steinberger Relation-' 

The Bell-Steinberger relation Eq. (16.74) can be expressed as 

2 (7 + JAm) [Re(6) - ilm(d)] = £ A L (f)A*(f)p(f) (16.156a) 

/ 
= J^ Tj 2a BR(K s ->• JtJT)r s + J2 ^L br ( k l ->• 3?r)r L 

f=ln f=ln 

+ 2[Re(e) - Re(y) - iIm(<5)]BR(K L ->■ ljtv)r L (16.156b) 

{2jt\H w \K l ) (3jt\H w \K s ) MrKn 

^=(2.|H W |X S >" ** = (3*|H W |K L ) (16 ' 156C) 

In the neutral kaon system only a few modes give significant contributions to the 
rhs of Eq. (16.156a) [see Eq. (16.112)]. Only the Jin{y), 3jz and ulvi modes turn 
out to be relevant up to 10 — 7 level [238]. The second line for the K13 intermediate 
states derives from the following calculation: 

(K s \H w \e+jT-v)(e+jt-v\H w \K L ) 

+ {K s \H w \ e -JT+v){e-Jt+v\H w \K L )p 
= \[(l + 4)^ + ^)\A(K°^e+JT-v)\ 2 
- (1 - e*)(l - eL )\A(^ -> e ~jT + v)\ 2 ]p 
= r(K L -> lvjl) l -[(\ + e*)(l + e L ){l - 2Re(y)} 

-(l-6*)(l- €L ){l+2Re(y)}] 
= (2Re(e) - 2ilm(d) - 2Re(y)}BR(K L -+ ivjr)/l (16.157) 

Dividing both sides of Eqs. (16.156a) and (16.156b) by T s gives 

2 f L + i tan^sw) [Re(«) - iIm(<J)] = £ ?7 2jr BR(K s -> 2jt) + ^ 

x [^ »7* T BR(K L -> 3JT) + BR(iC L -> Iv^)2{Re( e ) - Re(y) - ilm(<5)} 1 

(16.158) 

As BR(K S -> In) «a 1, r L /_T s ~ 1.75 x 10~ 3 , the contribution of the second line 
is much smaller. Since the above equation is a relation of complex numbers, we 
can determine two variables, Im(<5) and Re(y) treating Re(e) as given. Alternative- 
ly, by using Eq. (16.154) we can replace [Re(e) — Re(y)] by an observable quanti- 



16.4 Test ofT and CPT Invariance I 

ty (A s + A L ) on the rhs. Then we can determine Re(e) and Im(<5). Analyses by 
CPLEAR/KLOE [107, 238] give 

Re(e) = 159.6 ± 1.3 x 10" 5 , Im(<3) = 0.4 ± 2.1 x 10" 5 (16.159) 

Note on AS = — AQ Components 

So far we have completely neglected the AS = — AQ contribution as it does not 
exist in the quark model. If one is so skeptical as to doubt CPT invariance, it is 
logical to include the A S = — A Q contribution [denoted as x, x or x± = (x±x)/2], 
too. If its contribution is included, from a purely experimental point of view we 
have to add AS = — AQ amplitudes for leptonic decays to those expressed in 
Eqs. (16.143): 

(e + v c jt- \H W \K°) = a+ b AS = AQ (16.160a) 

(e-v e jz + \H w \K°) = a*-b* AS = AQ (16.160b) 

(e~v e ji + \H w \K°) = c + d AS = -AQ (16.160c) 

{e + v e 7t-\H w \t) = c*-d* AS = -AQ (16.160d) 
We define A S = A Q rule-breaking parameters by 



a+b 



-(1 + y), x= ~ (1-y) 



(16.161) 

We can also separate them into CPT-conserving x+ = (x + x)/2 and CPT-violating 
X— = (x — x)/2 terms. These parameters enter in the asymmetries A& m , A T , A C pt 
and can be determined if the asymmetries are reanalyzed including these parame- 
ters. So far there is no sign of them and their upper-limit values are given by [107, 
238] 

Re(x) = 1.9 ± 4.1 x 10 -3 

Im(%) = 1.2 ± 1.9 x 10~ 3 

(16.162) 
Re(x_) = 0.2 ± 1.3 x 10~ 2 

Im(x+) = 1.2±2.2xl0" 2 

K~K Mass Difference and Decay Width Difference 

We learned that CPT ir 



CPT invariance: ->• A u — A 22 (16.163a) 

or m K « = m-^o , r K o = T^o (16.163b) 
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Let us express the mass and width difference in terms of the CPT- violating param- 
eter <5. The CPT- violating mixing parameter is given by 

6 = A »~ A » = ™ »**''*" \i(M n - M 22 ) + ^kl 

2A A 2 Am I y u > 2 J (16.164) 

Mn - M 22 = m K o - m-^o , r u - T 22 = /> - r^> 
It is convenient to decompose d into d\\ and d± as in Eq. (16.124): 

Re(<5) = 5||cos0sw-<5j_sin0sw (16.165a) 

Im(d) = d\\ sin S w + 5_l cos </> sw (16.165b) 

where d\\, d± are signed real numbers. Then 

m K o - m-^ = Mn - M 22 = 6± = A,T{-Re(<3) tan</> S w + Im(<5)} 

~ 2Am{-Re(5) + Im(<5)| (16.166a) 

4Am 
T K o -r^> = Ai - A2 = <5|| = 2Ar{Re(d) + Im(<3)tan0 sw } 

~ 2AF{Re{6) + Im(<5)} (16.166b) 

As we have already determined Re(d) and Im(d), the mass and decay width differ- 
ence can also be evaluated. Inserting values of Re(<5) and Im(<5) in Eqs. (16.153) 
and (16.159), we obtain [107] 

<5 1, = 1.9±2.0xl0" 4 

(16.167) 
d± = -1.5 ± 2.0 x 10 -4 

and subsequently from Eqs. (16.166) we have [107, 236] 

m K o -m^>= 0.5 ± 5.8 x 10" 19 GeV (16.168a) 

r K o - T-go = -1.5 ± 2.0 x 10" 18 GeV (16.168b) 

Dividing Eq. (16.168) by the mass and width of the kaon, one finds the CPT 
violation can be at most [311] 

— ^- ^ < 0.8 x 10" 18 (16.169a) 

(16.169b) 

Equation (16.169a) is considered the most stringent proof of CPT invariance 
at this time. Equation (16.169b) is a measure of direct CPT violation [see also 
Eq. (16.175a)]. 
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Direct CPT Violation in 2jt Decays- 

The observed value arg(co) = —55.6° is in reasonable agreement with 6 2 — do = 
—47.6 ± 1.5°. Therefore the expression for m without CPT invariance assumption 
_ 1 ReA 2 + iImB 2 



V2ReA +iImB ' 

indicates ImBj 4C ReA/. With this assumption, the expressions for A L , a, e, e' in 
Eqs. (16.95) and (16.102)-(16.103) are modified as follows: 

1 Al _+ Re(e) _ j Re( d) - M J = Re( 6 ) - JRe(a) + Re(y)j (16.171a) 

(16.171b) 
(16.171c) 



ImA 


ilmAo + ReB _ . ImA J ReB | 


ReA 


ReA + ilmBo ~ l ReA ' (ReAoj 




l ReA ~^ eT V \ ReA ) 




rimA 2 -iReB 2 ImA -iReB ] 




W |_ReA 2 + iImB 2 ReA + iImB J 




/ImA 2 ImA \ 1 ReB 2 ReB 







(16.171d) 
VReA 2 ReA o y | ReA 2 ReA ) 

Those in large braces {• • • } are newly added CPT-violating terms. Relations between 

redefined e, e' and tj 00 , tj-\ from Eq. (16.120) do not change. 

If we approximate F K a — F-^> by its 2jt intermediate states, a calculation similar 
to what we carried out to derive r s — 2(ReA ) 2 p (see Eq. (16.113)) leads to 

r K o - r r ~ p[{\A(K° -> jz+jt~)\ 2 + \A(K° -* tz^ttY) 

- \\A(t -> jr + jr-)| 2 + |A(K° -> Jt°Jt°)\ 2 \] 

= p[(|A +B | 2 + |A 2 +B 2 | 2 ) 
-(|A*-B*| 2 + |A*-B*| 2 )] (16 . 172) 

= 4p[ReA ReB + ReA 2 ReB 2 ] 

|_ReA ' ' ReA 2 J 
ReB 
~ 2r " ReA 
Substituting Eq. (16.172) into Eq. (16.164), we obtain 

- , ReB ilh [mgo-m-go ReB l 

6 - 6 - rca; * * sin *™ e * sw [^a^ + rca; J < 16 - 173 > 

Relations between t],e,e',d and ReB /ReA are depicted in Fig. 16.14. 

We see that <5 || d± , namely 6 is almost perpendicular to the superweak direction 
in the 2n dominance limit (6<p = 0). In this approximation, 
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ce is not assumed (not to scale). CPT- 
approximation, d<p = 0, the CPT-violating 



Experimentally, e - S w ^ <P+- ~ </>sw = 0.61 ± 0.62 ± 1.01°. 

An estimate of the CPT-violating 2n decay amplitude can be obtained as follows. 
From Eq. (16.172) 

Re Bo r K<>-r T o 6 1| 

~ * K = !! — (16.175a) 

ReA 2r s cos^sw 

As was discussed in Eq. (16.138), arg(e') ~ 0sw and Eq. (16.171d) tells us that 



(16.140) \T/+-\ 



\ReA 2 ReA , 



Substituting experimental values of A0 and d y allows ReB /ReA and ReB 2 /ReA 2 
to be estimated from Eqs. (16.175) [107] 



= 2.6 ± 2.9 x 10" 4 , 



= 1.3 ± 4.5 x 10"' 



We conclude that under the present circumstances, there is 



(16.176) 
ign of CPT viola- 



16.4.4 

Possible Violation of Quantum Mechanics 



Theoretically, CPT invariance is deeply rooted in quantum field theory. Its violation 
means that one of Lorentz invariance, locality, unitarity or the principle of quantum 
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mechanics is not correct. Unitarity is related to the conservation of probability and 
Lorentz invariance (or its extension, Poincare invariance) to conservation of ener- 
gy or angular momentum, therefore the most suspicious was violation of locality 
in the strong interaction. However, the success of the Standard Model makes the 
possibility very small. With the advent of quantum gravity as exemplified by black 
hole radiation, it was pointed out that the existence of a quantum gravitational field 
might possibly affect the propagation of quantum mechanical waves [201]. In other 
words, when we discuss CPT violation, we have to dare to intrude into the sacred 
area of quantum mechanics. Ellis and others in [130, 210] proposed experiments to 
test the breakdown of quantum coherence due to quantum gravity. In their treat- 
ment, CPT violation is incorporated in the breakdown of quantum mechanics, 
where a pure state develops into a mixed state. They expressed the effect in terms 
of three parameters a,ji,y. These parameters have dimensions of mass and are 
expected to have values ~ m\\ m P i anc k -2x 10 -20 GeV. Phenomenologically, their 
effect most conspicuously appears as the modification (destruction) of macroscop- 
ic interference effects, for example in neutron propagation and K ->■ 2jt, 3jt, jilv. 
In the neutral kaon decays, the effect could appear as the loss of coherence of the 
neutral kaon wave function. Figure 16.15 shows how this quantum decoherence 
effect would appear in the asymmetry of K — > 2ji and K — >• Ijiv [107], which we 
have already shown in Fig. 16.9 and Fig. 16.1. So far no breakdown of quantum 
mechanics has been observed. The experimental data can set upper limits to these 
parameters, which are approaching the estimated values of ~ 10 — 20 GeV. 

Another test bench for the possible breakdown of quantum mechanics is provid- 
ed by DAPHNE and the B-factory, where correlated pairs of K°-K (or K S -K L ) and 
B°-B are copiously produced [56, 236]. Here, one can also test quantum entan- 
glement or the Einstein-Podolsky-Rosen (EPR) paradox [128] in connection with 
quantum decoherence. We refer to [60]. 




Neutral-kaon decay ti 



Figure 16.15 The measured decay-rate asymmetries (a) k% x and (b) A& m analyzed for a possi- 
ble loss of coherence. The dashed lines represent the expected asymmetries with positive values 
of a, fi and y, which are 10 times larger than the obtained upper limits [107]. 
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Experiments on CP Parameters 



Since all CP- and/or CPT-violation tests depend so much on precision experiments, 
we need to study how they are carried out and understand their limitations. We 
should also be alert in devising improved or new techniques or proposing new 
methods. Most of the CP-violation measurements made before 2000 were carried 
out by fixed-target experiments, in which a high-energy proton beam bombarded 
a target to produce neutral kaons (charged particles were removed by sweeping 
magnets). Because of the collimated kaon beam with high momentum, most ex- 
periments had to optimize their apparatus for specific purposes. With the advent 
of collider accelerators, a comprehensive study of CP phenomena became possible. 
For some specific purposes, however, where high flux of the kaon beam is required, 
the fixed-target experiments still have an advantage, although by no means an easy 
one. It took two large groups (NA31 ->• NA48 and E731 -+ KTeV) 20 years to deter- 
mine just one number, the direct CP violation parameter e'. We will describe how 
the experiments were done. 

Two collider detector experiments on kaons are worth mentioning: CPLEAR at 
CERN and KLOE at Frascati in Italy. Since they deal with low-energy kaons, exper- 
iments with a large solid angle (almost An) and covering 0-20 K s decay regions 
were possible. 

The former used a low-energy antiproton beam to tag neutral kaons K° (K ) by 
pp -y 7<C ± 7T :F , and the latter produced correlated kaon pairs by e~ + e + at sfs = 



e" + e + -+ <p -+ K°K or K + K~ with J pc = 1 — (16.177) 

The detector concept is very similar in both experiments. Here we describe the 
CPLEAR detector. 

16.5.1 
CPLEAR 

CPLEAR was an experiment to study CP violation in kaon decays. Tagged K° and 
K were produced by stopping low-energy ~p in a pressurized (at 16 bar) hydrogen 
gas target. The beam had a flux of 10 6 p s~ l and its momentum was 200 MeV/c 
with a spread of 5 x 10 -4 . The accumulated total number of p was 1.1 x 10 13 . The 
use of liquid hydrogen was avoided to minimize the amount of matter in the decay 
volume. The incoming p beam was delivered by the Low Energy Antiproton Ring 
(LEAR) facility at CERN and the size of the beam was about 3 mm FWHM. K° and 
K were identified by the r< 



p at rest + p ->• (K n + ) + K° 
-^(K+7t-)+K° 
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Figure 16.16 CPLEAR detector. Longitudinal view [107, 109]. 



each having a branching ratio of w 2 x 10 3 . The conservation of strangeness in the 
strong interaction dictates that a K° is accompanied by a K~ and a K by a K + . 
The highest momentum of the kaon was ~ 750 MeV/c, the mean decay length 
of K s was 4 cm and the cylinder volume with radius of ss 60 cm corresponded to 
roughly 20 K s mean flight paths, covering the interesting interference region. The 
whole detector was embedded in a warm i.e. normal conducting solenoidal magnet, 
which provided a 0.44 T uniform field [109]. 




Figure 16.17 Display of CPLEAR events. and scintillator counters that make a par 

~pp -► K~7t + K° with the neutral kaon de- indentifier. Others can be identified by c< 

cayingto (a) JT+Tr - and (b) e'JZ+V. Parts of paring with Fig. 16.16 [107, 109]. 
detectors: T; gas target, S 2 , C, Si; Cherenkov 
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The general layout is shown in Figure 16.16. A series of cylindrical tracking de- 
tectors (drift chambers, streamer tubes, etc.) provided information about the tra- 
jectories of charged particles to determine their charge signs, momenta and posi- 
tions. The reconstructed track resolution was < 350 urn in r and rep and 2 mm in 
z; 0.05% < Ap/p < 0.1%. Particle identification (PID) was carried out by a com- 
bination of a threshold-type Cherenkov detectors, to identify charged kaons, and 
scintillation counters, which measured the time of flight and dE/dx. The outer- 
most detector was a lead/gas-tube sampling calorimeter (ECAL) to detect photons 
produced in jt° decays. 

Two transverse views of the detector together with typical event displays are 
shown in Fig. 16.17. Both of them show K~ jt + tracks originating in the hydro- 
gen target. In Fig. 16.17a, K° has decayed to two charged pions while in (b) it has 
decayed according to K° -»• e~ji + v. The total number of events measured was 
6.4 x 10 s . 

16.5.2 
NA48/KTeV 

It may be surprising to learn that at the turn of the century, 30 years after their 
discovery, all the known CP-violation phenomena were still explained by a single 
number e and were consistent with the superweak model, which denied the exis- 
tence of direct CP violation in the decay or in any place other than in neutral kaon 
decays. Although the Standard Model predicted its existence and demonstrated its 
validity in the B-meson factory experiments later, proof of direct CP violation, also 
known as e', was one of the biggest issues at the turn of the century. The issue was 
settled by two definitive experiments, NA48 [282] at CERN and KTeV [244] at Fer- 
milab. They are good examples of how modern state-of-the-art experiments using 
fixed targets are carried out and deserve some detailed explanation. 

Assuming CPT invariance, the effect can be tested experimentally by n 
the double ratio of K L and K$ decaying to both yr+jr - and jz°jz°: 

r ( ic->*+o/r(i(s->»+o _|.7 ± -P 1XCT / 



r(K L -> jz°ji°)/r(K s -> ji°ji°) | j7oo I V € ) 

Re (e'/e) ~ 1.6 x 10 -3 (16.179) 

The above number (10 -3 ) may not seem impressive, but in terms of decay asym- 
metry, which is represented by the value of Re(e') itself (see Problem 16.7) rather 
than its ratio to e, one realizes the difficulty of the experiment: 

& = m°^/)-m°^/) 

1 r(t -► f) + riK° -+ n 

1 J ' y J ' (16.180) 

_ j-2Re(e') = -5.2 x 10" 6 / = ji+ ji~ 

~ |+4Re(e') = 10.4 x 10" 6 / = Ji°7t° 



16.5 Experiments on CP Parameters I 667 



retarget II jagger 




~126m 



Figure 16.18 Scheme of the beam arrange- 
ment for the NA48 experiments. The K L bean- 
comes straight from the K L target. The atten- 
uated primary beam is bent and hits the Ks 
target 120 m downstream of the K\_ target. 



AKS is the K s anti-counter located at the exit 
of the K s collimator. The two beams are aimed 
at the center of the detector with crossing an- 
gle 0.6 mrad. Note the difference between the 
vertical and horizontal scales [282]. 



In such experiments, one needs not only to accumulate high statistics using pre- 
cision detectors, which can only be constructed using modern technology, but also 
to minimize the systematic error, which constrains the ultimate precision. Mea- 
surement of the double ratio has the inherent advantage of canceling many sys- 
tematics. By measuring the four decay modes simultaneously, any time variation 
or background correction common to all four modes cancels out. 

Both experiments are similar in detector concept, differing mainly in the prepa- 
ration of the beam. NA48 makes separate K L and K s lines that converge at the 
detector. KTeV makes two parallel 7C L beam lines and uses one to regenerate a 
K s beam. Figure 16.18 shows the layout of the NA48 beams. A proton beam 
(~ 1.5 x 10 12 protons per pulse, 450GeV/c), delivered by the SPS at CERN, im- 
pinges on a target that is located 126 m upstream of the decay volume. The charged 
components are removed by sweeping magnets. The neutral beam consists almost 
entirely of photons, neutrons, and K° mesons, with some few residual K°, A and 
S particles at the highest energies. The remaining primary beam goes through 
a bent crystal and about 10 -5 of it, satisfying the channeling condition, 16 ' goes 
through the tagger and hits the K s target just 6 m upstream of the decay volume. 
The tagger is used to separate K s events from those of JC L origin in the time do- 
main. The energy of the neutral kaons is 70-170 GeV with average 110 GeV. The 
whole target and the collimator system are aligned along an axis pointing to the 

16) If a particle goes through a single crystal at an angle close to a major crystal direction, coherent 
scattering on the lattice nuclei forces the particle to follow the lattice direction. The role of the 
bent crystal is similar to that of a bending magnet, with reduced intensity and excellent selection 
of well-defined emittance. Its effect was used to separate desired particles from the halo of the 
beam [122]. 
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Figure 16.19 Top view of the KTeV (E832) d< 
z = 159 m [244]. 



i target (meters) 

tector. The evacuated decay volume extends to 



center of the detector 120 m away, such that the two beams intersect at this point 
with an angle of 0.6 mrad. 

In KTeV, a primary proton beam from the Fermilab Tevatron [3 x 10 12 protons per 
pulse, 800GeV/c in a 20 s extraction cycle ("spill")], on the other hand, makes two 
identical iC L beams, one of which hits the regenerator target 120 m downstream 
of the primary target. The average kaon momentum is 70 GeV/c, somewhat lower 
than that of NA48 because of the different primary energy and beam distance from 
production to decay point. The regenerator consists of 84 scintillator modules of 
10 x 10 x 2 cm 3 , each viewed by a photo multiplier. The last module defines the 
beginning of the decay volume. The total length is 170 cm, corresponding to about 
two hadronic interaction lengths, and the magnitude of the regeneration amplitude 
is \p\ ~ 0.03. 

For the detector we mainly explain the KTeV apparatus, with comparisons where 
appropriate, because the concepts of KTeV and NA48 are very similar. Figure 16.19 
shows the top view of the KTeV(E832) detector. The evacuated decay volume ex- 
tends to z = 159 m. 

Two typical event displays, one showing K -*■ Jt + Jt~ and the other -* 2jt°, are 
shown in Fig. 16.20. 



Reconstruction of K — ► jt + ji~ A magnetic spectrometer composed of a magnet, 
which gives a kick of 412 MeV/c, drift chambers and triggering hodoscopes recon- 
structs trajectories of charged particles with the momentum resolution o p /p = 
[1.7 © (f/14)] x 10~ 3 , where p is the momentum in GeV/c. Kaon identification 
was performed by reconstructing the invariant mass (16.44) of the two charged 
particles (see Fig. 16.20, left), using the same method as the first CP-violation ex- 
periment by Fitch and Cronin, but with far better accuracy. The position resolution 
is ~ 100 u,m and the mass resolution is ~ dm K sa 1.6MeV/c 2 . Within the ac- 
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Figure 16.20 Event display: Upper figures show Csl configuration and deposited energy clus- 
ters. Lines are trajectories (shown in lower figures) of reconstructed pions projected on the Csl 
plane with vertex at crossing, (a) K -+ ji+tt-; (b) K -+ ji°ji° -+ Ay [391]. 



ceptance window of 488 < m K < 508MeV/c 2 , backgrounds for K L -> n + n~ 
(0.1%) come mainly from K -» jte(fi)v, where e(pi) were misidentified as pions. 
The p\ distribution can separate K s -*■ 2jt from the coherent regenerated beam 
(cut on pj < 250 MeV/c 2 ) from that of the background (diffraction scattering), as 
was shown in Fig. 16.5. It also tells us that major background for K s — > n^ n~ 
(0.087%) comes from regenerator scattering. 

Reconstruction of K — ► jt°m A Csl electromagnetic calorimeter is composed of 
2232 inner cells (2.5 x 2.5 cm 2 ) and 868 larger outer cells (5x5 cm 2 ), covering a total 
area of 1.9 x 1.9 m 2 with two holes in the center through which the beams pass. The 
configuration is shown in Fig. 16.20. The depth of each crystal is 50 cm (27 X ). The 
energy resolution is o E /E = 2%/VT©0.4%, where Eis in GeV. K ->■ jt°jt° -> 4/ 
deposits energy at four places (see Fig. 16.20, right). The energy E 1: E 2 of the two 
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photons satisfies the relation 

m\, = (Ei + E 2 ) 2 -( Pl + p 2 ) 2 = 2E 1 E 2 (l-cos<2)^4£ 1 £ 2 2 

« 4£ 1 £ 2 r 2 2 /z 2 (16.181) 

where z is the distance to the decay position, r 12 is the distance between the two 
energy clusters in the detector and is their opening angle from the decay point. 
Assuming any 2y 's come from jz° decay, the vertex position is reconstructed. Out 
of three (=4C 2 /2) pair combinations made of Ay's, two with the minimum separa- 
tion are chosen and m K is reconstructed (Fig. 16.21b). NA48 uses a liquid krypton 
ionization chamber and its energy resolution is o E /E = 3.2 ± 0.2%/s/E © 9 ± 
1%/E © 0.42 ± 0.05%. The vertex position was determined with a method similar 
to KTeV using the formula d — Y,i>j y/WEjriJ/m K with assumed m K . The res- 
olution of the longitudinal vertex position was on the order of 20-30 cm for KTeV 
and - 50 cm for NA48. Backgrounds for K L ->- 2jt° (0.484%) come mainly from 
3jt°, where 2y 's are missing, and for K s —*■ 2jt° (1.235%) mainly from the regen- 
erator and collimator scattering. The backgrounds for neutral decay modes are 10 
times larger than those for charged decay modes and are one of the major sources 
of errors. This is due to the energy resolution of the electromagnetic calorimeter 
and inapplicability of the p\ cut to eliminate the regenerator scatterings. 

Other Detectors At the back end are located steel absorbers and muon identifiers. 
The decay volume and some parts of the detector are equipped with an extensive 
array of photon veto counters to reduce trigger rates and backgrounds and to define 
sharp apertures and edges that limit the detector acceptance. 

Overall Correction Separation of regenerated K s — > 2jt events from those of CP- 
violating K L — > 2n is physically clear as the two beams are ~ 20 cm apart. The 
solid angle difference due to separation and the fact that they are not aligned along 
the center of the detector could be a potential source of systematic errors. The 
regenerator also produces some background. KTeV observes four decay modes si- 
multaneously, by frequently switching the regenerator position alternately between 
the two beam positions and also the polarity of the magnet. Any time fluctuation 
cancels when the ratio is taken and the geometrical difference averaged out. Note, 
however, some backgrounds contribute differently to the four modes, hence do not 
cancel and have to be corrected carefully for each mode. Use of high resolution 
detectors is considered essential for removing such background. 
The total numbers of events accumulated are 



N(K L -+ 7t°Jt°) 


& 


3.4 x 10 6 


N(K S -» JT°jz°) 


« 


5.6 x 10 6 


N(K L ^ JT+JT-) 


« 


11.2 x 10 6 


N(K S -^jt+jt-) 


% 


19.4 x 10 6 



(16.182) 

Total « 40 x 10 6 

Before Re(e'/ e ) can be extracted from these data, backgrounds must be identified 
and subtracted, and acceptance corrections must be determined. Both these tasks 



16.5 Experiments on CP Parameters I 671 




450 460 470 480 490 500 510 520 530 540 

(a) |\fl MeV/c 2 




450 460 470 480 490 500 510 520 530 540 550 

(b) 7i°7i° invariant mass MeV/c 2 

Figure 16.21 Reconstructed mass distribution of 2x from \ 
estimated backgrounds by Monte Carlo, (b) jr°jr°. Backgro 
photons missing [244, 347]. 
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Figure 16.22 z-Vertexdi 



n beam (dashed) and regenerator beam (line histogram) [244]. 



require Monte Carlo simulations of the experiment. Especially important is the 
decay vertex distribution, which differs widely between K L and K s , as shown in 
Fig. 16.22. While the K L decay distribution is almost flat in the decay volume, 
that of K s halves in about 2 m. This could be a potential source of serious sys- 
tematic errors. NA48 uses a weighted z-distribution for K L vertices [weight factor 
~ e~ AFst = eW^M 1 /^— i/n.)] t0 ma k e them similar, which contributes to cancella- 
tion of many systematics but at the expense of statistics. KTeV, on the other hand, 
relies on elaborate Monte Carlo simulations to assure the correctness of recon- 
struction. Figure 16.23 shows examples of decay distributions for K -> ji + jt~ 
and K L ->■ Jte(n)v. The agreement is excellent, but the degree of disagreement 
indicates the limits of systematic errors, too. In the end the largest single error 
comes from the overall reproducibility of the events, exemplified by the K L decay 
distribution in Fig. 16.23. 

Both NA31/NA48 and E731/KTeV spent nearly twenty years in deriving their 
final results. During this time, they both learned from experience and from each 
other. The single biggest issue common to both experiments was to determine 
geometrical efficiency and, in the end, their differences were minor compared to 
their similarity. 



Extraction of Re(e'/e) K s components include regenerated K s a p and CP- 
violating K s oc rj, which are not distinguishable. The number of events is propor- 
tional to the square of the sum t] + p [see Eq. (16.86)] and is a function of p/rj. 
The fitting must be carried out including not only Re(e'/e) but also Am, r s , the 

phases (p 00 , <p^ and p. The accuracy can be cross-checked by floating well-known 

parameters such as r s and A m to see if they agree with the known values. The end 
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Figure 16.23 Vertex distributions of K L frorr 
reconstructed data and Monte Carlo (MC) si 
indicating good agreement [244]. 



_ (a) and K ->■ Jilv (b) for both 
A/er figures show the ratio data/MC, 



results of KTeV and NA48 are [51, 244, 245, 311] 

Re(e'/e) = 2.07 ± 0.26 x 10" 3 KTeV 
Re(e'/e) = 1.47 ± 0.22 x 10" 3 NA48 
weighted average = 1.65 ± 0.23 x 10~ 3 PDG08 

which clearly established the nonzero value of e' . 



(16.183a) 
(16.183b) 
(16.183c) 



16.6 

Models of CP Violation 



In discussing models of CP violation, a primary assumption is CPT ii 
Experimentally, the relative strength of the CPT-violating strong interaction is es- 
timated from Eq. (16.169a) to be < 10 -18 . Dividing this number by a, the upper 
limit of CPT violation in the electromagnetic interaction is found to be < 10 -16 , 
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and by dividing by G F m 2 K , that of the weak interaction < 10 -14 . These values give 
the most stringent test of CPT violation. For the electromagnetic interaction, there 
exists a test by the magnetic moment of the electron 

[g(e + ) - g(e~)l/£avera g e = -0.5 ± 2.1 X 10~ 12 
[g(fl + ) - g(^-)]/g aV e rag e = -0.11 ± 0.12 X 10" 8 

but all other tests give at most ~ 10 — 4 . CPT symmetry is known to hold with 
excellent accuracy. 

Theoretically, CPT invariance has rock-solid foundations, as described in 
Sect. 16.4.4. It is very hard to construct a self-consistent theory without CPT 
invariance. Therefore, we assume its invariance in the discussions below. In the 
following paragraphs we list models that accommodate CP violation (denoted as 
JSF). The models are classified according to where one seeks the origin of CP vi- 
olation. The reason for listing models other than the Standard Model is because 
the Kobayashi-Maskawa model is considered OK within the Standard Model, but 
not sufficient to explain the matter-antimatter asymmetry of the universe, and 
theories beyond the Standard Model, including GUT (grand unified theory) and 
superstring models, are hot issues in the world of high-energy physics. 

CP Violation in the Strong Interaction The model assumes that j2f occurs in two 
steps. First, JC L decay to X in a CP-conserving interaction, followed by X — >• 2jz 
in the strong interaction. If we let /be the strength of the transition amplitude 
X -> 2ji, /has to be ~ 10 -3 to explain |?/| ~ 10 -3 and is referred to as the 
millistrong force. Since CPT invariance is assumed, the interaction is one of four 
possible processes: 

(1) vXi (2) ^,T (3) P'.CT' (4) fXi 

In the electromagnetic transitions of atomic levels, a small mixture of different par- 
ity components induces interference with the main component of the amplitude 
and rotates the direction of polarization. This tiny polarization shift is experimen- 
tally observed, but can be explained quite well by the neutral current effect of the 
Standard Model, hence models (2), (3) and (4), which include P violation, are ex- 
cluded [50, 70]. The possibility of millistrong interaction remains in principle, but 
no one has conceived of a natural way to embed CP violation in QCD and the pos- 
sibility does not seem to be taken seriously. 

CP Violation in the Electromagnetic Interaction If the electromagnetic interaction 
is the cause of CP violation, a ~ \rj\ means CP has to be violated maximally. 
However, experimentally observed asymmetry values in the decay of rj mesons 
rj ->• jt+jt-jt , Ji+n-y give 0.9 ± 1.7 x 10" 3 , 9 ± 4 x 10" 3 [311]. Therefore 
C violation in the electromagnetic interaction is small if any. The P-violating com- 
ponent does not exist beyond that allowed by the Standard Model, as was explained 
in the previous section. Therefore, the possibility of CP violation originating in the 
electromagnetic interaction is excluded. 



16.6 Models of CP Violation I 

Superweak Model The superweak model assumes an interaction H S w with 
A S = 2 is the cause of CP violation. The CP-violating term ( K° | H S w I K > appears 
in the mass matrix A u . Hermiticiry of H S w dictates the CP-violating component 
must satisfy A 2 i = A* 2 . Then from Eq. (16.67) 

A 2 i-A 12 lm({K \H sw \K°)) 

^^21^ = - ( rs-rl)/ 2 -iAm (16 - 185) 

Therefore, the relation <p € = </> S w = tan -1 (2 Am/ AT") holds exactly (strictly speak- 
ing, +jt is possible theoretically but excluded experimentally). Calling the strength 
of the superweak model ~ JG F , we have / ~ eAm/Gpm^ ~ 10 — n . The effect 
appears only in the mass matrix and does not appear in the A S = 1 decay ampli- 
tude, hence e' = 0. Therefore, the following equalities are exact in the superweak 
model: 

\f] + -\ = Ij/ooI, 4>+- = 0oo = 0, = 0sw (16.186) 

In this model, all the CP-violating effects can be described by a single parameter e 
and do not appear in other mesons, including B mesons. Although the superweak 
model was finally excluded by NA48/KTeV and by B-factory experiments, histori- 
cally it played a prominent role for nearly 40 years. 

Milliweak Interaction The milliweak model assumes the CP-violating effect occurs 
in the AS = 1 interaction, whose strength is ~ 10 -3 . There are varieties of mod- 
els. One model increases the number of Higgs particles to introduce complex num- 
bers in the Lagrangian. Another is a left-right symmetric model, in which parity 
violation is assumed spontaneous. Since the Standard Model has been established, 
these have become exotic models that go beyond the Standard Model but are dis- 
cussed in GUT in the context of the matter-dominant u 



Origin of CP Violation in the Standard Model The CP violation effect in the Stan- 
dard Model belongs to the class of milliweak interactions. CP violation can be in- 
troduced into the Lagrangian by making the coupling constant complex. In the 
Standard Model, the weak interaction is mediated by W and Z°. They couple 
to weak isospin doublets and the charged current interaction Lagrangian has the 
form of Eq. (15.200), namely 

Xm = -^W L Y"(r+W+ + T-W~)W L 

\uA M M (16 ' 187) 

If we denote u j (j = 1, ...,3) = (u, c, t) and d k (fe = 1.....3) = (d,s,b), the 
interaction Lagrangian of the charged W boson can be written as 

£w = ~% H [ujLY>'Vjkd kL W+ + 'd kL y" V* k u jL W"] (16.188) 
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where Vjk is the KM (Kobayashi-Maskawa) matrix element introduced in 
Eq. (15.160b). If we apply CP transformation [Appendix F, Eq. (F.4)] to the above 
Lagrangian, it is transformed to 






£P» 



- + UjLY" 



(16.189) 



We see that CP invariance requires the KM matrix V jk to be real. As the field uj , d k 
has the freedom to change its phase, many KM elements Vjk can be made real us- 
ing this freedom. Let us see how many complex numbers the KM matrix has. In 
general, a N x N complex matrix has 2N 2 independent real variables. Unitari- 
ty requirement reduces the number to N 2 . In this N 2 -dimensional space of real 
numbers, N C 2 = N(N — 1) space rotations can be defined in terms of rotation 
angle 0j . The rest of the variables become phase angles <pk- On the other hand, the 
2N quark fields have the freedom to change their phase as 



(16.190a) 
(16.190b) 
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2 N — 1 phases, excluding the 
tion of the quark fields. In the end, 



q be removed by redefini- 



(16.192) 

phases remain. Therefore for N < 2, all the matrix elements can be made real and 
no CP violation effect appears unless put in explicitly by hand. As CP is violated, 
N must be > 3 was the original motivation of introducing the KM matrix. As the 
measurement of Z° decay width shows that the number of neutrinos with m v < 
m z /2 is limited to three (see Sect. 15.7.4), it is natural to think N = 3, as is required 
by the Standard Model. In this case the KM matrix is expressed by three rotation 
angles and one phase angle, as in Eq. (15.162). In this convention, the dominant 
contribution of the phase d is contained in V u i, and V t d- 

Therefore, in the Standard Model, Feynman diagrams that contain V„j, or V t d are 
CP-violating amplitudes. Referring to the Standard Model description of K ** K° 
transition amplitude, Fig. 16.3, when the intermediate states j or j are top quarks, 
the transition t <-> d attaches V^ to the amplitude and it becomes a CP-violating 
amplitude. But the decay amplitudes are on-mass shell; no top quarks appear in the 
decay diagrams in the lowest order approximation, referred to as the tree diagram, 




(a) 

Figure 16.24 



K -+ Jt+TT- decay amplitudes. Tree diagrams (a,b) do not contain KM matrix 
The penguin diagram (c) contains the top quarks in the intermediate states. The 
V t d is complex, hence diagram (c) violates CP. 



shown in Fig. 16.24a,b. In order to insert the top quark in the intermediate states, 
the higher order diagram or the so-called penguin diagram has to be introduced 
(Fig. 16.24c). 

The Standard Model clarifies the dynamical structure of CP violation and its cal- 
culation gives the right order of magnitude for the experimental values, but owing 
to QCD complexity accurate estimates are hard to obtain. 
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Hadron Structure 



Historical Overview 



We have succeeded in constructing the quark model from hadron spectroscopy. 
What it can tell us is about the static nature of the hadron, for example its mass, 
electric charge, spin, isospin and strangeness. They are all characteristics that can 
be observed from outside. The quark model cannot tell us about the inner struc- 
ture of the hadron. It cannot tell us whether an elementary particle with fractional 
charge really exists inside the hadron. In order to investigate the inner structure of 
the hadron, we need a probe that can go inside the hadron and show us its struc- 
ture. Qualifications for the probe include (1) its own properties are well known, (2) 
it has small size and (3) it can penetrate deep inside the hadron. From this point of 
view, leptons are useful. One may compare the situation with the situation at the 
beginning of the 20th century, when there were several models of atomic structure: 
the a particle was used as a probe, and the investigation of scattering patterns from 
a golden target led to the Rutherford model being established. 

In the 1960s, the world of particle physics was in confusion. In the field of strong 
interactions, confidence in field theory was weak and the idea of particle democracy 
based on the analyticity of the scattering amplitude was at its peak. Above all the 
bootstrap theory told us that all particles are equal, no elementary particles exist 
and that they themselves are building blocks of other particles, namely they are 
their own fundamental constituents. Part of the reasoning came from an experi- 
mental observation of the nucleon form factor. A series of experiments by Hofs- 
tadter et al. [81, 206] showed that, unlike the atom, the nucleon does not have a 
core and the whole structure is soft like a cloud or jelly. There was no reason to 
believe in the existence of a substructure inside the nucleon. Unlike 50 years pre- 
viously, this time Thomson's viewpoint seemed more pertinent than Rutherford's. 
In addition to denying subatomic structure, particle physicists at that time found 
the unusual properties of the quark, such as fractional charge, incompatible with 
their common sense. Even Gell-Mann, the quark proposer himself, said for a while 
that it should be considered as a mathematical tool rather than a real model. 

The situation changed drastically when deep inelastic scattering data were pre- 
sented in 1969. This was the Rutherford experiment brought up to date in the sense 
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that it clarified the parton structure of the nucleon, namely, that the nucleon is com- 
posed of point-like particles. Series of deep inelastic scattering data, beginning with 
those from the MIT-SLAC group, clarified that the nucleon is a quark composite 
and that the quark really has fractional charge, which paved the way for quantum 
chromodynamics (QCD) to appear as the theory of the strong interaction. 

This chapter explains the process of clarifying the inner structure of the nucleon 
to be a composite of point-like particles generically called partons, and the role of 
deep inelastic scattering of electrons (and muons) and neutrinos in identifying the 
partons as quarks and gluons. 



Form Factor 

Momentum Transfer as a Measure of the Size We Can Probe The electron is the best 
known and most easily obtainable probe of all time. In order to use the electron as 
a probe, we need to know the formula for its reaction with other particles. The basic 
equation is the Rutherford scattering formula introduced in Sect. 6.3.1: 
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()is the momentum that the electron gives to the target, or the strength of the im- 
pact, and is called the momentum transfer. Equation (17.1) holds when the target 
is a point particle. If the target is spread and has a charge distribution p(r), the 
potential is modified to 

V (r)-> V'(r) = j dr'V(r-r')p(r') (17.2) 

and the matrix element to 

V fi -+V' fi = Jdr V'(ry<Z' = ^f d(r - r') V(r - r') e '^~^ 

xjdr'p(r')e'^' 

= VfiF(Q 2 ) (17.3a) 
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F(Q 2 ) = J drp(r)e^ 



where, for simplicity, the distribution was assumed to be spherically symmetric. 
F(Q> 2 ) is called the form factor and is a measure of the target spread. Normalizing 
the charge distribution by 

f drp(r) = 1 (17.4) 



From the above discussion, we know that the cross-section formula for an extended 
target is obtained from the point target by multiplying by the form factor (squared) 

do(0) = do pt \F(Q 2 )\ 2 (17.6) 

The above equation says that if we know the cross section formula of a point target, 
we can obtain information about the target spread by measuring the scattering 
pattern. As is clear from Eq. (17.3b), the part of p(r) where Q- r » 1 does not 
contribute to the integral because of violent oscillation due to exp(iQ- r). This 
means that the form factor is a measure of total charge integrated over the region 
r < 1/Q. In other words, the electron has penetrated to the depth given by r ~ 1/Q. 
The maximum value of the momentum transfer Q gives the limit to which we can 
probe the inner structure of the target. 

Target Size When Q- r <JC 1, the form factor can be expanded as 
l + (i<2. r )-(l/2)(i<2-r) 2 ;---] 



= Jdrp(r)[l-\ 



.£, 



-f 



(r 2 ) = / drp(r)\r\ 2 = -6 -^ (17.7b) 

J dQ 2 \ Q2=0 

This means that if we can measure the cross section for small values of | Q\, the 
information we can obtain is limited to the total charge (Za) and the size of its 
spread. The larger the value of | Q\ we can obtain, i.e., the larger the energy and 
scattering angle, the deeper we can probe the inner structure. 

Examples of Form Factors It helps to understand the physical meaning of the form 
factor if we know the spectral shape of typical spatial distributions. We list a few 
representative distributions. 
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Spherical Distribution p(r) = p for r < R and p(r) = for r > R (Fig. 17.1) 

(17.8a) 

This is similar to light going through a pinhole making several bright and dark 
rings and is called a diffraction pattern. It is a typical wave characteristic that com- 
monly occurs on going around behind a clear-cut boundary. 

Nuclear Charge Distribution The nucleus looks like a blurred sphere with radius 
R = r a A 1 ^. Figure 17.2b shows scattering data for an electron incident on calcium. 
It resembles the diffraction pattern, but somewhat smoothed. A charge distribution 
that reproduces the observed pattern well is depicted in Fig. 17.2a. R represents the 
nuclear size, t = (4 In 3) a the thickness of the surface or the degree of blurriness. 
One sees valleys become shallower as the surface is blurred. Fitting parameters to 
the data, we obtain, roughly, r ~ 1.03 x 10~ 13 cm, t ~ 2.4 x 10~ 13 cm. 
Exponential Distribution When the distribution becomes harder smoothly towards 
the core, it may be represented by an exponential or a Gaussian: 



p(r) = p exp(-r/R) 
F(Q 2 ) = 7 



(1 + QV-R 2 ) 2 



(17.9a) 
(17.9b) 



A good example is the sun, which is a gas sphere with intensity slowly rising to- 
wards the core. Note: The reader is reminded that the Yukawa potential V(r) ~ 
e flr /r (which is a singular distribution and hence cannot be normalized) has the 
Fourier transform V(0_ 2 ) ~ 1/(1 + 0_ 2 //^ 2 ), which resembles the exponential form 
factor but is softer. 
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e Ca SCATTERING 

Best fit: this data (750MeV) 




Figure 17.2 (a) p(r) = p /[l + exp(r- R)/a], t = (4ln3)o. (b) e-Ca scattering data. From fitted 
curves R = r^ , r = 1.03 x 1CT 13 cm, t = 2.4 x KT 13 cm are obtained [57]. 



Gaussian Distribution 

p{r) = p exp(-r 2 /2<7 2 ) 

F(Q 2 ) = exp(-QV/2) 



(17.10a) 
(17.10b) 



Problem 17.1 

Derive Eqs. (17.9) and Eqs. (17.10). 



17.3 

e-p Elastic Scattering 



Electromagnetic Form Factor of the Nucleon The proton has spin s = 1/2 and a 
spatially spread structure. If it were a point particle, namely a Dirac particle, the 
elastic scattering cross section with an electron 

e(w,k) + p(E,p) -» e(a>', V) + p(E'.p') (17.11) 

could be obtained from the e-fi scattering cross section in the laboratory (LAB) 
frame [Eq. (7.20)] by replacing the target mass by the proton mass M: 
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~M "MO, Q4 2 V — , 

q 2 = -Q 2 = (fe - fe') 2 ~ -2<W(1 - cos 6) 

0' = co/[l + co(l- cos 0)/M] (17.12c) 

Q 2 = —(j 2 is a relativistic version of Q 2 . The formula is valid for the electron 
mass set to m = 0. The first term in [• • • ] represents scattering by the electric 
Coulomb field created by the target and the second by the magnetic field. When 
the target spread is taken into account, the charge distribution and the magnetic 
moment distribution are generally different and two different form factors need to 
be introduced. In order to see how to include them, we express the most general 
proton current in the form 

qj/(x) = e(p'\J"(x)\p) = ej/(p\p) e -'i- x 

, r y i ( 17 - 13 ) 

= eu(p') [FxfoV + 2M ^V^vJ u (P) e ~' q ' X 

From Lorentz invariance alone, other terms proportional to q" = (p' — p) 1 ' , (p' + 
pY are allowed. However, using current conservation d^J/ = and the Dirac 
equation that the plane wave functions u(p), u(p') satisfy, it can be proved that the 
above two terms are enough. 



Show that a term proportional to q 1 ' = (p' — p) 1 ', if it exists in Eq. (17.13), is 
forbidden by current conservation. Next, prove the equality which is valid when 
sandwiched between u(p') and u(p): 

y " = Jm [ (]/ + p) " + ia " Vqv ^ (17 ' 14) 

which is referred to as the Gordon equality. 

The Gordon equality shows that the term proportional to (p' + p) ft is already 
included in Eq. (17.13). To understand the physical meaning of Eq. (17.13), rewrite 



2M 2M J J Krl 

(17.15) 
and take nonrelativistic limit [p' 11 , p" ->• (M, 0)]. Then expressing A /t = (<p, —A) 

(p' + p) f 'A u -^2M(f> , ia f "'q v A^ l -)■ ia ij qjAi ~ a- V xA= a- B 

(17.16a) 
If the form factors, are normalized as Fi(0) = 1, F2(0) = 1, we have 

e]/(p',p)A fl -^e<p + (l + x )J-a.B (17.16b) 
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From this equation, we see the first term ~ yf contains magnetic moment (Dirac 
moment) interaction as well as the Coulomb interaction, but the contribution of 
the anomalous magnetic moment is contained in the second term ~ ia" v q v . As 
the proton has charge e and magnetic moment fi p = (1 + % p )e/2M 

1=1,(0) = 1, F 2p (0) = l, % = Xp (17.17) 

Similarly, as the neutron has total charge 0, although the inside distribution may 
not vanish, 

Fi„(0) = 0, F 2 „(0) = 1, X = Xn (17.18) 

The experimental values are given by 

X P = 1.79, x n = -lM (17.19) 

Rosenbluth Formula Using the proton current Eq. (17.13) and the electron current 

qj/(x) = - e <fc'|j»|fc) = -eu(k')y"u(k)e-^ 



q=k-k' 



(17.20) 



The electron-proton elastic scattering amplitude is given in the one-photon-ex- 
change approximation as 

iT fi = -e 2 j d 4 x j/(x) ( Z ^) J,» (17-21) 

and the cross section becomes 

(17.22) 
Defining 

G E (q 2 ) = F l (q 2 ) + ^F 2 (q 2 ), G M (q 2 ) = F^q 2 ) + X F 2 (q 2 ) (17.23) 

the cross section Eq. (17.22) is rewritten as 

do I m' [G 2 + zGh , , ( 0\~\ 

7777 = g M~ * M +2rG^tan 2 - (17.24a) 



AM 2 AM 



(17.24b) 
( fe - fe '> 2 = Wsin2 7' ^= 7X7777^77-77^ ( 17 ' 24c > 



This is referred to as the Rosenbluth formula. In the limit q 2 — >• 0, Ge — >• 1, 
G M -» 1 + %, Gz(q 2 ), Gu(q 2 ) are considered as expressing the electric charge and 
magnetic moment distributions, respectively. 
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Early data on elastic e-p scattering came from electron beams at Stanford univer- 
sity and SLAC (Stanford Linear Accelerator Center). As the cross section has the 

5 



1 da , , ,/0\ 

-^ = A(Q 2 ) + B (QW( Y ) 



(17.25) 



by changing angle 6 at fixed Q 2 , A and B and consequently G E , G M can be obtained 
as a function of Q 2 . Figure 17.3 shows data on the proton magnetic moment. 

The observed data can be fitted well with experimental formulae in the range 
< Q 2 < 30 (GeV/c) 2 



G,;' - 



1 



P» [l + (Q 2 [(GeV/c)2]/0.71)] 



G E "(Q 2 ) = 

Comparing the expression with Eq. (17.9), we see the charge distribution inside the 
neutron is neutral, as it is outside, and all other distributions are very smooth from 
outside to the core and well represented by an exponential shape. This means the 
size of the nucleon is about 



/F>. 



0.7 x 10~ 



- l/(2m„) 



(17.27) 



If the nucleon has a core and has charge Ae, the form factor should approach X 
asymptotically. However, the data do not show such a tendency and can be inter- 
preted as that the charge distribution is very soft right to the core. 




Figure 17.3 (a) Form factor of the proton magnetic m 
butions of the proton and the neutron in early models 



(b) PpW (c) 

ment [364]. (b,c) Expected charge disl 
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Before the appearance of the quark model, the nucleon was considered to emit 
and reabsorb pions 

p^n + 7t + , n^ P + 7t- (17.28) 

In this image, the charge distribution is expected to be of the form depicted in 
Fig. 17.3b,c. The proton charge distribution will be positive at the center and the 
neutron distribution is expected to be positive in the core and negative in the pe- 
riphery. The data do not support this concept. In the quark model, p = uud, 
n = udd. Since both charge and magnetic moment distribution are considered 
to represent that of the constituent quarks, the similarity of Gg , G^ and GjJJ are 
easily explained. The net charge of the neutron vanishes everywhere if the three 
udd distributions are the same. As the three quarks are separate and do not make 
any particular hard core if distributed smoothly, the exponential form and absence 
of a hard core are also explained. 



17.4 

Electron Proton Deep Inelastic Scattering 

17.4.1 

Cross-Section Formula for Inelastic Scattering 

In quasielastic scattering, where the final state is composed of a scattered electron 
and a recoiled particle with mass M x 

e(k) + P(p) -»• e(fc') + X(P F ) generally M x / M p (17.29) 

there is a one-to-one correspondence between the scattered electron angle 6 and 
the energy loss v = to— to' if the incident energy to and M % are fixed. Measurement 
of one variable determines the other, too (Problem 17.3). 



Fixing the incident electron energy on a target at rest and setting W — M x , derive 
the relation 



between the momentum transfer Q 2 and scattered energy en' . Since Q 2 is a func- 
tion of oj, m' and cos 6, this equation means the scattering angle 8 is uniquely 
determined once co' is known. Specially for elastic scattering ( W = M), show that 

Assume the electron mass m = 0. 
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For inelastic scattering, W(= M x > M) is arbitrary and takes continuous values 
within the kinematically allowed region. In this region, the scattering angle 9 and 
the scattered energy m' are independent. If we observe only scattered electrons, the 
cross section has the generic form 

^£ 7 = a M [w 2 (Q 2 , V ) + 2W 1 (Q 2 , V )tan 2 ^jJ (17.32) 

Here, W x , W 2 are called structure functions, analogous to form factors in elastic 
scattering. Unlike the elastic case, they are functions of both Q 2 and v. Equation 
(17.32) can be derived from Eq. (17.21). The transition amplitude for inelastic scat- 
tering can be written as 

iT fi = i(2ji) 4 d(k + p-k' -Vf)),^—}/ (17.33a) 

j efl = u(k')y u u(k) , ;/ = (X(P f )|J EM "(0)|p(p)> (17.33b) 

Following the standard procedure, the cross section can be derived as 
d 3 fe' 



V = \J2 u(k')y fl u(k)u(k)y v u(k') = 2 [fe;,fe„ + k[,k fl + (q 2 /2)g flv ] 

(17.34b) 



(2^ 



L spm J 



As Jp is hermitian 

W f,r = W vf '* (17.35) 

W" v is a Lorentz tensor. Moreover, it is a function of q and p only, because the 
proton spin average has been taken. Therefore it can be expanded by the available 
Lorentz tensors q f "q v , p''p v , p''q v , p v ' q ft ', i^' vpa p p q a . The current conservation 
9/J M ~W/ = constrains 

q fl W f,v = q v W f,v = (17.36) 

Three Lorentz tensors satisfy this condition: 

(_ g , + ^). ( P .-^)( P -^). 



The third term is an antisymmetric tensor and has opposite parity relative to the 
first two. Since the electromagnetic interaction conserves parity, this term does not 
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enter. In fact, by multiplying by L flv , it is easily proved that their product vanishes. 
Therefore, W v can be expressed as 

which defines W x and Wj. Using 

q^ L flv = q v L flv = Q (17.39) 

we can show 

L flv W f,v = 4<W W 2 cos 2 - + 2 W x sin 2 - (17.40) 

From this we finally obtain the desired formula 

d 2 a 4a 2 „T , i(0\ ■> ,/fi\l 

^^7 = ^-' 2 [w 2 ( Q 2 , V )cos 2 ( Y j +2 W l(Q 2 ,v)sm 2 (-jj 

= CT M [w 2 + 2W 1 tan 2 ('^] (17.41) 

Problem 17.4 
Prove Eq. (17.41). 



Problem 17.5 

Assuming the proton to be a structureless Dirac particle, and setting the final state 
of the proton as p' (i.e. elastic scattering), calculate the structure function by in- 
serting Jp = eu(p')y f 'u(p) in Eq. (17.34c), or by comparing Eq. (17.12a) and 
Eq. (17.41), and show that 

^(Q 2 .") = *>("- 7^) ( 17 - 42a ) 



2MJ 



(17.42b) 
Hint: Use the following formula: 
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17.4.2 

Bjorken Scaling 

In 1969, the S LAC-MIT group carried out an experiment in which an electron beam 
was incident on a proton target and the intensity of the scattered electron was mea- 
sured. Figure 17.4 [149] shows some of the data: a differential cross section as a 
function of final hadronic mass W for a fixed incident energy u> of 10 GeV and the 
scattering angle 6=6°. Here, one sees a sharp peak (scaled down by 1/8.5) at 
the left side corresponding to elastic scattering, and a few other quasielastic peaks, 
corresponding to resonance production 



e + N* M* ; 



(17.44) 



Below the peaks one sees a broad hill-like bump, which constitutes inelastic scat- 
tering, where multiple particles are produced with invariant mass W continuously 
changing in the range allowed kinematically. One also refers to this region as the 



Data similar to in Fig. 17.4 sliced at some narrow range of Wband and plotted 
as a function of Q 2 are shown in Fig. 17.5 [76]. One sees that the cross section 
of elastic scattering falls off rapidly as a function of Q 2 , which is a manifestation 
of the target being an extended object, but those of inelastic scattering at large W 
(loosely dubbed deep inelastic scattering) show a relatively slow decrease. 

Actually, such asymptotic behavior was expected by Bjorken on a theoretical ba- 
sis [62]. A more quantitative description of the asymptotic behavior at large hadron 



Elastic Scattering 




E 10.0 GeV 






Figure 17.4 d 2 a / d Q d m' (£ 2 = lOGeV, 
9 = 6°), is plotted as a function of 
W, the invariant mass of the hadronic fi- 
nal states [69, 149, 227]. The elastic peak 
{W = M) is scaled down by 1/8.5. Its radia- 
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mass W is expressed 

in the limit w, Q 2 , v ->• oo 



vW 2 (Q?,v)^F 2 (x), 
MWi(Q 2 ,v)->- Fi(x) *> 



(17.46) 



Equation (17.45) is often referred to as the scaling limit. Equations (17.46) mean 
that in the limit, the structure functions are not functions of two independent vari- 
ables but depend only on their ratio, which is referred to as Bjorken scaling. In 
Figure 17.6, MW X and v W 2 are plotted as functions of d = (W 2 + Q 2 )/Q 2 = 
1/x + M 2 /Q 2 , which is ~ 1/x in the scaling limit. The variable d is used to take 
account of the mass effect in the low-energy region. 

They do not depend on Q 2 , as is shown in Fig. 17.7, a clear manifestation of 
Bjorken scaling. Bjorken's reasoning for the scaling is as follows. Since MWi,vW 2 
are dimensionless quantities, physically fixed variables such as the proton mass 



1) These Fi,F 2 e 



to be confused with the form factors that appeared in Eq. (17.13). 
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Figure 17.6 Bjorken scaling, (a) Structure functions plotted as a function of 

d (denoted as m) = {W 2 + Q 2 )/Q 2 = 1/x + M 2 /Q 2 [148, 149]. They do not depend oi 

Q 2 , as is shown in Fig. 17.7. The plot consists of data with W > 1.8 GeV. 




Q 2 (GeV/c 2 ) 
)es not depend on Q 2 . The plot 



17.5 Parian Model I 693 

can be neglected in the scaling limit and they should depend only on an available 
dimensionless variable, which is x. 

We will explain the physical meaning of the Bjorken scaling in terms of partons 
in the next section. In summary, the soft structure inferred from observation of 
elastic scattering had to be reconsidered in light of the deep inelastic scattering 



17.5 

Parton Model 

The parton model was proposed by Feynman to analyze high-energy hadron colli- 
sions [142, 143] and applied by Bjorken and Paschos to e-p inelastic scattering [65] 
(see also [394]). 

17.5.1 

Impulse Approximation 

Let us assume that the nucleon is a collection of point-like particles (partons) and 
that the scattering cross section can be approximated as the sum of the scatter- 
ing by partons (impulse approximation). Denoting the partons by qi, the impulse 
approximation means 

da I ^ da I 

Summation of the individual scattering events requires that each scattering is in- 
dependent of and does not affect the others. For this assumption to be valid, the 
time T^t necessary to finish the interactions must be sufficiently short compared 
to the time Tf ree , which is a measure of each parton flying freely 

tint « T free (17.48) 

As the parton is inside the nucleon, it is bound to other partons. The motion of a 
parton induces a drag effect on other partons due to forces working between them. 
The drag time would be of the order of the inverse of the binding energy 

Tfree ~ 5 i - > — (17.49) 

binding energy Ef ree — E m M 

The magnitude of the binding energy is expected to be at most the nucleon mass 
(M). It is not small, but fixed, and does not increase with the nucleon energy E = 
^j p 2 + M 2 . Viewed from the incident electron, the high-energy nucleon looks like 
a flat pancake (Fig. 17.8a) because of Lorentz contraction. The contraction factor is 
given by 1/y ~ M/ E. 

The interaction time is considered to be the time that the electron spends in the 
nucleon or that it takes to pass through the nucleon. As the nucleon has the size 
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Figure 17.8 (a) Because of Lorentz contrac- 
tion, the target proton looks like a pancake 
viewed from the incoming electron. The inter- 
action time, which is the time for the electron 
to pass through the nucleon, is much short- 
er than the partons' mutual interaction time 

direction of motion). Partons are free in this 



approximation, (b) Parton kinematics. Only a 
parton with fractional momentum x of the par- 
ent proton interacts with the electron. Other 
partons are spectators and pass through with- 
out interacting. Partons are considered free 
before and after scattering. When they fly away 
beyond the hadron size, they transform them- 
selves into many hadrons (hadronization). 



s rest frame, the interaction time is given by 



Consequently, for high energies where 

E » M 2 /™^ - 6.3 GeV -► r lnt « r free (17.51) 

the condition Eq. (17.48) for the impulse approximation will be satisfied. 

Next we rewrite the condition in terms of the parton variables. Because of Lorentz 
contraction, the longitudinal momentum p y of the nucleon is much larger than the 
transverse momentum p± of the parton inside the nucleon. Therefore, for the i- 
th parton's longitudinal momentum we have J3;|| » pi± ~ (p±), where (p±) 
is the average transverse momentum of the parton inside the nucleon. Its value 
(~ 330 MeV/c [335]) can be estimated from transverse momentum distributions of 
the seconday particles in p-p collisions (see Fig. 13.30). When the electron collides 
with the proton, the latter is excited to a virtual state with mass W. The inverse of 
the energy transfer to the nucleon is a measure of the interaction time and is given 
by 



q°~ ^p 1 + W 2 - Jp 2 + M 2 ■ 



W 2 -M 2 



IMv 



(17.52) 



where we have used Eq. (17.30) in deriving the last equality. On the other hand, 
if we set the longitudinal parton momentum as fraction x of the parent nucleon 
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(Pill = Xip\\), the time for which the nucleon can be separated to free partons 
would be given by 



(17.53) 

where fi ; is the mass of the i-th parton and we have attached subscript 1 to the 
parton that interacts with the electron. If the nucleon has infinite energy [or ap- 
proximately higher energy than that satisfying Eq. (17.51)], we may consider that 
all the partons move in the same direction as the parent nucleon (i.e. X; > 0). Since 
the sum of momenta of all the partons has to be equal to that of the nucleon, we 



E* 



/ '-, i 7 Pi + Pt i u'-^v 

and calculate AE, which gives 

"^["■-E(^)] 

The above equation means that the impulse approximation is not valid for x\ — > 0, 
Xi — > 1. A measure for the approximation to be valid is given by 

1 » x t »fi/p, p±/p (17.56) 

The parton has energy-momentum given by 

Pi" = (\/(XiP\\) 2 + P 2 ±+fl Pi±, xtPi) (17-57) 

but within the above conditions Eq. (17.57) can be approximated by pf ~ %;p''. 
Using the condition that the parton is free before and after the scattering (p 2 = fi 2 , 
see Fig. 17.8b) 

li\ = (x lP ) 2 = {q + x lP ) 2 (17.58a) 

.-. x = Xi = *— = _9L (17 .58b) 

2{p.q) 2Mv 
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Figure 17.9 Kinematics of deep inelastic scat- 
tering. The gray triangular area indicates the 
allowed region. Elastic scattering is represent- 
ed by the line with W = M p (x = 1) and 
quasielastic by W = M Al , Ma 2 . The darker 



gray half-oval corresponds to the deep inel 
tic scattering (DIS) region. The higher the 
energy, the closer the DIS region comes to 
boundary. 



In summary, conditions for the impulse approximation to hold a 
E » M , Q 2 » M 2 , 2Mv » M 2 



< x = -^- < 1 (17.59b) 

These are the same conditions as for Bjorken scaling to hold. The scattering that 
satisfies the conditions is called deep inelastic scattering. The coordinate frame 
satisfying the conditions is called the infinite momentum (poo) frame. Figure 17.9 
illustrates the allowed region for Q 2 , 2Mv, and how x,y, W are related to them. 
The variable y is introduced for later applications. According to the observed data, 
the scaling seems to hold early for Q > 1 GeV/c (Fig. 17.6b). We next investigate 
the physical meaning of the structure functions. 

17.5.2 

Electron-Parton Scattering 



Scaling Variables As will be clarified later, the partor 
ons. The gluons are electrically neutral and do n 



:onsist of quarks and glu- 
ribute to electron-proton 



the parton is a quark. It is a point-like particle having 
;s fi and electric charge q ?alton = A q e. The scattering cross section 



(17.60) 
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in the parton's rest frame is given by making the changes M ^ [i, a ^ A q a in 
Eq. (17.12) 

da\ 4(V) 2 ft/ 3 2 mr Q 2 2 /0^ 

- — = — cos I — I1+ -^+. tan — 

^U,lab Q 4 * \2jl 2^ \2 J 

In reality, the parton is inside the nucleon with relative momentum given by 
Eq. (17.59b). Therefore we need to define Lorentz-invariant variables and rewrite 
Eq. (17.60) in terms of them. Let us first show that the variables so far treated 
as those in the laboratory frame can, in fact, be considered as Lorentz-invariant 
variables. Neglecting both proton and electron masses in the limit E 3> M , the 
following equalities hold in the LAB frame: 



= (k + p) ~ 2(k-p) = 2ojM (17.61a) 

- = - = — (17.61b) 



. (p ■ g) = g> - 

" (p ■ k) w 

. Q 2 = Q 2 = 

" 2Mv sy 

M 2 x 



■*!)- 



(1-y) V 27 s(l-y) 



(17.61c) 
(17.61d) 



One sees that variables in the laboratory frame [w, v,sin 2 (6 /2), etc.] can be e 
pressed in terms of Lorentz-invariant variables (s, x, y). 

Problem 17.6 
Confirm 

Q 2 2Mv 

0<^-(=^)<l, 0<— (=y)<l (17.6 



Parton Distribution Function For elastic scattering in the laboratory frame (LAB), 
6 and co' are related as given in Eq. (17.31). Marking variables in the center of mass 
frame (CM) with asterisk, we can rewrite y in CM as 

tp-q) E*(oj* -o/*)-(-k*)-(k* -k'*) l-cos6»* 
(p-k) E *co*-(-k*-k*) 2 [ ' 

We obtain 

An An I AiriU 2 An I An I 

(17.64) 



d Y dw 'liAB s (!-y) 2 dQ Ilab dfi Icm 

The above equation holds for two-body scattering with target mass M. When we 
apply it to the electron-parton two-body system, we make the replacements M -»• 
[i, s ->■ s q = (fe + p ? ) 2 . Noting that for elastic scattering 

Q 2 = 2/<v = s q y (17.65) 
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we rewrite Eq. (17.60) to obtain 

da\ AnXla 1 r y 21 

^L = ^- s 4 (1 - y)+ yJ (17 - 66) 

As the parton's momentum in the nucleon is given by xp , we have 

s q = (k + xp) 2 ~2x{k-p)~ xs (17.67) 

Writing the i-th parton's momentum distribution function in the nucleon as qi(x), 
the probability that the momentum is in the interval from x to x + dx is given by 
qi(x)dx. Then the scattering cross section of the electron with the i-th parton in 
the nucleon is given by 

do Ana 2 f v 2 l ,, 

^ = ~&~ S I'' " Y) + 2 \ ^W dx < 17 ' 68 ) 

Consequently, the electron-nucleon scattering cross section is obtained by sum- 
ming over all the partons and becomes 






a?* 



Note that the cross section is factorized as a product of two functions each con- 
tining x or y only. We have derived the e-p scattering cross section as a sum of 
e-q scatterings. Note, the partons are elastically scattered by the electron, but the 
recoil parton is not an observable by itself. The quarks, also known as partons, 
are confined, and when they come out of the nucleon they are observed as pions 
and many other hadrons. However, characteristics of e-q elastic scattering can be 
obtained by observing the behavior of the electron only. Note that Eq. (17.69) is ex- 
pressed only in terms of electron variables, which can be observed experimentally. 
Rewriting the generic expression Eq. (17.41) for inelastic scattering in terms of x 
and y (Problem 17.7), we have 

d 2 a I Ana 1 \ /V\l 

— ^ r —s[ V W 2{ l-y ) + 2xM Wl ^\ (17.70) 

This equation becomes in the scaling limit 
d 2 o I Ana- 



dxdy | inel 



F 2 (x)(l-y) + 2xF 1 (x)(j^] 



Comparing Eq. (17.71) with Eq. (17.69), we see that in the quark-parton model the 
structure function is given by 

F 2 (x) = xY J ^Mx) (17.72a) 
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We have proved that by considering e-p inelastic scattering as a sum of e-q elastic 
scatterings, the structure functions MWi(Q 2 ,f) and v W 2 ( Q 2 , v), both of which are 
functions of two independent variables, become, in the scaling limit, functions of 
x — Q?/2Mv only. Bjorken scaling is evidence that the hadrons have substructure 
and are made of point-like particles. We have also succeeded in finding a physical 
interpretation of the structure functions in Bjorken scaling. They represent the 
momentum distribution of partons in the nucleon. 

Problem 17.7 

Show that Eq. (17.41) becomes Eq. (17.70) if one rewrites it in terms of x and y. 



17.6 

Scattering with Equivalent Photons 

In this section we try to interpret deep inelastic scattering in terms of equivalent 
photons, those emitted by the electron. It offers a different perspective and helps to 
give an intuitive interpretation of the picture. Changing the way of expression does 
not necessarily increase information, but in some cases it is easier to interpret. 
Besides, the viewpoint presented in this section can be directly extended to QCD 
applications. 

17.6.1 

Transverse and Longitudinal Photons 

The formalism we have developed so far is based on the assumption that one-pho- 
ton exchange gives a good approximation. Then what the nucleon sees is not the 
electron but a photon. If we take the view that the nucleon interacts with the pho- 
ton, the electron is just a supplier of photons. The cross section can be considered 
as a convolution of the photon-nucleon interaction cross section and the inten- 
sity of the photon provided by the electron integrated over the photon spectrum. 
The cross section formula expressed this way is called the Weizsacker-Williams 
approximation [78, 96]. 
The interaction of the photon with a charged particle is given by 

£,NT = -Qe/ d A xA^x)f(x) (17.73) 

the transition amplitude is given by 

T fi = (2nfd\q + p- P F )e,(X){P F \y'(Q)\p) (17.74) 
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and the photon absorption cross section in the LAB frame can be written as 



x 2 E (PF\^M)J"\p)(PF\erWr\p)* (17.75) 

where W" is the same structure function for the e-p inelastic scattering as given 
by Eq. (17.34c). K is the photon energy in the LAB frame, M the target mass and 
e ft (X) the polarization vector of helicity I. 

There are two problems associated with the photon being virtual. One is the ap- 
pearance of an additional (A = 0) polarization. The virtual photon is like a massive 
vector particle but its four-momentum q is space-like (q 2 < 0). The polarization 
vectors have to be orthogonal to the photon's energy-momentum vector q, namely, 

q-e(l) = q fl e u (l) = A = ±,0 (17.76) 

If the particle is a massive free vector particle, q is time-like (q 2 > 0) and all three 
polarization vectors satisfying Eq. (17.76) are space-like. But the virtual photon is 
space-like, and one of the polarization vectors must be time-like. Consequently, we 
define three polarization vectors as 

e f(±) = T(0,l,±i,0)/V2 (17.77a) 

e''(0)= — =(qs,0,0,q ) = — L= ( Jv 2 - q 2 ,0,0, v ) (17.77b) 



It is possible to make v = by choosing a frame referred to as the Breit frame, 
which we will discuss soon. In this frame 

e /'(0) = (1,0,0,0) (17.78) 

For this reason, the photon with I = is sometimes called a scalar photon. 2 ' The 
polarization vectors satisfy orthogonality and completeness conditions: 

€(A) • e(l') = ^(A)^(l') = (-lfdw (17.79a) 



£(-l)^V(^ W = -(^-^f) 



(17.79b) 



The metric is different from the ordinary definition of the completeness condition. 
The metric used here takes into account that e„(0) is time-like [e(0) • e(0)* = +1]. 
See also arguments in Appendix C. 

2) Confusingly though, it is conventionally referred to as the longitudinal photon. 



7 7. 6 Scattering with Equivalent Photons I 701 

Problem 17.8 

Show that the polarization vector of the scalar photon can be expressed as 

^<0> = a(^-<^>/), a=^-^f) 1 ' 2 (17.80) 

where pf is the four-momentum of the target proton. 

The second problem is the definition of the incident photon energy K. For the 
virtual photon, any variable that converges to the real photon energy in the limit 
Q 2 ->■ can be a candidate. Here we adopt a definition of the photon energy in 
terms of the flux Eq. (6.84) [168]. 3) 



4KMILAB = 4[(q ■ p) 2 - q 2 p 2 f 2 = 4M(v 2 + Q 2 ) 1 ' 



(17.82) 



Using Eqs. (17.75), (17.77) and (17.79), we can express the cross sections for scat- 
tering by transverse and longitudinal photons as 

ff T - \^ E «/.W«v(A)* W* = ^W 1 (17.83a) 

°l = ^e,(0)€„(0)* W" v = ^ W L (17.83b) 

(17.83c) 

(17.84a) 
(17.84b) 
(17.84c) 



3) Hand's definition [194], which can be defined by observable variables of the final state only, is 

K = (W 2 -M 2 )/2M = v- Q 2 /2M (17.81 



W L = 


[i l+ i) w ' 


-,] 


In the region 


where the scaling law holds 


o r ->■ 


4jr 2 a 
KM 


■Fi 




Ol-> 


Air 2 a 
KM 


Fl 




Fl = 


2x y 


-1xV x ) 




Problem 17.9 








Derive Eq. (17.83). 
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17.6.2 

Spin of the Target 



Callan-Cross Relation As is clear from Eq. (17.72), when the target has spin s 
1/2, the equality 



= 2x¥ x 



(17.85) 



holds, which is referred to as the Callan-Gross relation [85]. In this case, ct l /<j t = 
0. On the other hand, if the target has spin s = 0, there is no magnetic scattering, 
which means F x = or o T /a L = 0. In other words, interpreting the reaction as 
that by the photon, the target spin can be easily recognized. 

Breit Frame An intuitive explanation can be given in the Breit frame (also called 
the brick wall frame). In this frame (Fig. 17.10), the electron, moving in the x- 
z plane, changes its momentum in the z-direction only, without changing p x . In 
other words, there is no energy transfer and it gives only recoil momentum to 
the target. On the other hand, the target proton (or parton inside) comes in from 
the +z direction, receives recoil momentum only and bounces back in the +z 
direction. In this frame, both particles move as if they had bounced off a brick wall. 
In terms of the virtual photon, it comes in from — z direction and is absorbed by 
the proton. Since a transversely polarized photon gives spin component s z = 1 to 
the target, a s = target cannot absorb the photon and cTt = 0. If the target has 
s = 1/2, spin transfer As z = 1 causes spin flip. If the photon is longitudinally po- 
larized, spin transfer A s z = may also be possible, allowing it to interact with both 
photons. But notice that the electromagnetic interaction is of vector type, which 
keeps the particles chirality invariant. This means the particle preserves its helicity 
(namely its spin has to be flipped) in the mass-zero limit and forces cr L = 0. Exper- 




Figure 17.10 In the Breit frame, the electron 
and the proton recoil from each other without 
changing energy and p x but changing p z , as if 
they had bounced off a brick wall. The proton 
enters from the +z direction and recoils back 
in the +z direction, while the photon ejected 
by the electron enters with transverse mo- 
mentum q, which is absorbed by the proton. 



igitu- 



As the electron's spin is 1/2 and the vir 
photon's 1 (if transverse wave) or (if 
dinal wave), angular momen 
dictates that only the transverse wave can 
be absorbed by the target parton. Therefore 
cr L = 0. On the other hand, if the target par- 
ton has spin 0, it can absorb the longitudinal 
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Figure 17.11 The Callan-Gross equality means R = Ol/gj = 0. Observation shows it is 
generally true except in the region x < 0. 1 [16, 59]. 

imentally, partons in the proton have dominant s = 1/2 components (Fig. 17.11) 
as expected. 

Problem 17.10 

When the target has mass fi or transverse momentum p 2 ^, show that 

o T Q 2 

Hint: Calculate the cross section Eq. (17.83) in the Breit frame. In this frame 
the virtual photon has momentum only and no energy. The scalar polar 
expressed in a simple form, too: 



tf = (0,0,0,/^), «"(0) = (1,0,0,0) 



Note, if [i 2 + p 2 ~ 0(0.25 GeV 2 ), then a L /a T ~ 0(1/ Q 2 ). If they have x depen- 
dence, Fig. 17.11 can be explained. 

17.6.3 
Photon Flux 

Highlighting the role of the photon in the deep inelastic scattering, we also intro- 
duce the concept of the effective photon flux that the electron provides. 
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We can rewrite the cross section as 

"-{ep -> e'X) = />t + eo L ) (17.88a) 



dQdco 



1+tf, 



r = 2^^137- e =[ 1 + 2 ^^ tan2 lJ < 17 ' 88b > 

Using d 3 fe' = dk'dkj/jz = co(l — y)dydk\jn and Eqs. (17.61), the above expres- 
sion becomes in the infinite momentum frame 



da{ep ->• e'X)= ^<7 T + e(y) <7 L J/^&d«/ ► cr T dP(y,p$)dy 

dP(y,p 2 T )dy = ^P y£ (y)%dy , p 2 = (W sin |) 



One sees that the quantity dP(y, p^)dy can be interpreted as the photon flux pre- 
pared by the electron. Equations (17.89) are called the Weizsacker- Williams formu- 
la [78, 96, 381, 390]. 

Problem 17.11 
Derive Eqs. (17.88). 



Problem 17.12 
Derive Eqs. (17.89). 

The expression diverges for p\ — > 0, which is the familiar Coulomb force singu- 
larity at r = 0. Integration over p\ can be carried out as follows. The denominator 
p\ is really Q 2 in the original expression, which is expressed in the infinite mo- 

Q 2 = _(fe - k') 2 ~ -^L- + Y ^- (17.90) 

Therefore, we can set the lower limit of p\ integration to not but m 2 e y 2 if the 
divergence is to be avoided. The upper limit of the integration is given by (Pj) max ~ 
(k'6 m3X ) 2 ~ w 2 (l - yfO^. Neglecting In 6^ x compared to ln( co 2 /m 2 ), we have 

I d\np 2 ~ 2\n(co/m c ) (17.91) 

which gives the integral form 

o(eN ->• e'N') = - In — i + * - Y ' a(y N -+ N'; y) dy (17.92) 
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Equation (17.92) is also referred to as the Weizsacker-Williams formula in integral 
form. 

The Weizsacker-Williams formula considers electron-proton scattering as a two- 
step process: the electron-photon and photon-proton reactions. The electron's role 
is simply to provide the photon flux. The formula is valid only in the infinite mo- 
mentum frame but it becomes a very useful tool in QCD, where such frames are 
routinely provided. 



17.7 

How to Do Neutrino Experiments 

Deep inelastic scattering by the electron clarified the parton structure of the nucle- 
on. We now want to do the same experiments with the neutrino to take advantage 
of its ability to distinguish the quark flavor. It took 25 years before the neutrino was 
detected after Pauli's prediction in 1930 and another 30 years before high-intensi- 
ty neutrino beams could be used for systematic investigations. The neutrino is the 
only particle that participates exclusively in the weak interaction. 4 ' However, once it 
became possible to do experiments with intense neutrino beams, they provided an 
ideal probe to investigate the characteristics of the weak interaction, which under 
normal circumstances is hidden behind the strong and/or electromagnetic inter- 
actions. The neutrino is electrically neutral, and because of its weak reaction rate 
with matter, neutrino reactions do not happen easily. To do neutrino experiments 
by actively detecting it, construction of a large accelerator and commensurate de- 
velopment of technology were indispensable. We describe briefly the essence of 
how to do neutrino experiments. 

17.7.1 
Neutrino Beams 

High-energy (J>> lOMeV) and high-intensity neutrino v u (v u ) beams are mainly 
obtained using 

,+,„-, „„ +„-,+„„,„„, 

K+(K-) - ,,+fj,-) + v„(v„) 

To obtain v e (v e ) 

are used. For low-energy v e neutrinos (l-4MeV), /3 decays of fission material pro- 
duced in atomic reactors are also used. 

Gravity becomes important when 
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In early experiments before about 1970, the requirement of a higher intensity 
beam was much stronger than its quality. For this purpose, it was necessary to 
focus pions and kaons produced by the primary proton beam in the forward direc- 
tion. It is impossible to focus all of them but a horn nearly achieves it. For realistic 
explanations we take examples from Fermilab and CERN in the 1970s, which sup- 
plied primary protons with energy ~ 400 GeV. The horn is shaped as shown in 
Fig. 17.12a, with a cross-sectional view through its center shown in (b). When a 
large current (say ~ 10 4 A) is supplied to the horn, a circular magnetic field outside 
and surrounding the horn is generated. Such a large current can only be made by 
storing charge in a condenser (capacitor) bank and discharging it instantaneous- 
ly by shorting the circuit. Because of this, the beam has to be condensed in time 
(fast extraction), which makes the operation of neutrino experiments incompatible 
with others, because most of them require slow extraction to use time coincidence 
techniques to select reactions. In this sense, neutrino experiments are costly. 

The target is placed at the mouth of the horn, where it is irradiated by the proton 
beam. Produced secondary pions (and kaons) go forward, but those emitted at a 
large angle enter the magnetic field and are bent forward. The horn is designed to 
focus secondary particles emitted at almost any angle within a limit. The particles 
then go through a long (~ 350 m) tunnel, where they decay to produce neutrinos. 
Pions that did not decay are stopped by a thick shield at the end of the tunnel. Since 
hadrons interact strongly, a concrete or iron shield of a few meters is enough to 
absorb all of them. But muons interact only electromagnetically and lose typically 
only ~ 1.8MeV/g. To filter muons of, for example, 200 GeV, it takes ~ 10 5 g = 
100 kg = 120 m of iron. If cost matters, earth may be substituted, but its density is 




(b) 
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Figure 17.12 (a) Principle of horn focusing. 
When current / is supplied to the cylindrical 
horn, a circular magnetic field is produced, 
which focuses one sign of charged pions in 
the forward direction, (b) Cross-sectional vie 



of the horn, (c) WBB (wide band beam) anc 
NBB (narrow band beam) spectra of the ne 
trino beam. NBB is produced by monochro 
matic secondary pions and kaons [352]. 
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TARGET 
Beam Monitor 



2ndary „ „ Beam Dump 

Transport Beam Ion Chambers Beam Stopper / 




Neutrino Detector 



Earth Shield 
(Muon Filter) 



500 m- 



Figure 17.13 Layout of the dichromatic neu- 
trino beam line at Fermilab circa 1970. The 
primary proton beam hits the target and pro- 
duces pions (and kaons). The pions are made 
monochromatic by a beam focusing system 
consisting of dipole and quadrupole magnets, 



then they are injected into the decay pipe, 
where they decay to muons and neutrinos. 
Undecayed pions are absorbed by a beam 
dump and muons are stopped by muon filters 
made of earth (or iron). 



small (2-3 gr/cm -3 ), and the necessary length would be ~ 3 times, which causes 
10 times intensity loss [oc l/(distance) 2 ]. The neutrino beam obtained in this way 
has a wide spectrum and is called a wide band neutrino beam (WBB). 

A typical spectrum is shown in Fig. 17.12c. Oppositely charged particles are bent 
in opposite directions and diverge, which can be used to separate v form v some- 
what, but particles in the forward direction inside the horn are not separated, which 
is the reason for a large mixture of v in v. Figure 17.13 shows the layout of a Fermi- 
lab neutrino beam. A dichromatic beam is described below but the overall layout 
is similar to that of a WBB beam, except that the horn is replaced by a set of dipole 
and quadrupole magnets. 

As the technology advanced, one wanted to have a beam of good quality with good 
control of energy and intensity. For this purpose narrow band beams (NBBs) began 
to be used. The neutrino being neutral, its energy cannot be measured directly, 
but taking advantage of two-body decay \Jt(K) ->■ fiv\ in producing it, one can 
determine its energy kinematically. If we mark variables in the CM frame with * 
the neutrino energy in the LAB frame is connected to those in the CM frame by 
Lorentz transformation: 



£,lab = y(K+Pp* cos d *) 



(17.95a) 



p* = E* = (M 2 - m 2 )/2M (17.95b) 

where M is the parent's (jt or K) mass and f5, y are Lorentz factors that are ex- 
pressed in terms of the parent's energy (E) and momentum (P) as 

y = E/M , Py = P/E (17.96) 

As the parent is spinless, the decay distribution in the CM frame is isotropic. There- 
fore, the distribution of particles in the LAB frame is 



= — dQ* = -dcos6»* = - 
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Namely, the energy spectrum of the neutrino is flat between the minimum and the 
maximum of its energy 

min(£ vLAB ) = E*y(l -ft) ~ (17.98a) 

M 2 -ml f mil 

max(£ FlAB ) = £,*y(l + /?) ~ 2y " = £ 1 - -£ (17.98b) 



(17.98c) 



_ j 0.427 £ jt decay 
~~ J 0.954 £ K decay 

Examples of such flat spectra are shown in Fig. 17.12c. One sees that the : 
neutrino energy from K decay is about two times that from n decay. However, the 
pion production cross section is much higher than that of kaons. As the decay angle 
V and the energy E v of the neutrino in the LAB frame are related by (Fig. 17.14a) 



(17.99) 



if there is independent information about the neutrino decay angle 6 V , its energy 
can be determined. Using E » M, cos 6 ~ 1 — 6 2 /2, we obtain 

E v ~ £ Fmax ^L_ , Q = M /p (17.10O) 

The distribution curve is plotted in Fig. 17.14b. 6 V can be determined from the 
distance of the interaction point at the detector from the beam center. Figure 17.14c 
shows a scatter plot of 6 V versus E v . The two bands correspond to neutrinos from 
K and n. In this way, at fixed angle one can obtain a dichromatic beam with two 
energy peaks. The low-lying flat spectra of Fig. 17.12c are integrated spectra of the 
dichromatic beam. 

Recent Developments More recently, high intensity monochromatic beams of 
varying energy have become available [362]. This apparatus uses horns with a sec- 
ondary beam set at an off-axis angle from the primary beam, taking advantage of 
the differing spectra of the secondaries depending on the production angle. The 
primary beam intensity is also increased. The so-called superbeam uses a proton 
beam having 1-4 MW power. 

With future neutrino oscillation experiments in mind, the concepts of "neutri- 
no factory" and "beta beam" are being developed [42]. The former uses an intense 
tertiary muon beam that is phase rotated and cooled 5 ' for better optical proper- 
tiess, accelerated, stored and allowed to decay. In this way 10 20 muons/year are 
envisaged. The beta beam accelerates /^-active unstable isotopes, stores them and 

5) Phase rotation is an accelerator technique that uses a magnetic field to make the energy spread 
narrow at the expense of making time spread wide or vice versa. Cooling is another technique to 
reduce transverse divergence of the beam. Those techniques are necessary because the secondary 
and tertiary beams spread widely both in energy and direction. 
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Figure 17.14 (a) If the parent particle is 
monochromatic, the neutrino energy E,, 
and the decay angle 8 V are related, (b) When 
£ » M„ iK , E v ~ E vmax x o 2 /(0 2 + O 2 ). 
(c) Scatter plot of measured neutrino ener- 



lets them decay. Since /5 decay has very low Q value in the isotope's rest frame, 
neutrinos from accelerated isotopes have good quality. They are monochromatic, 
collimated and with virtually no background. 

17.7.2 

Neutrino Detectors 



Event rates of neutrino experiments are given by N = N F (7 tot N target . Since <7 tot 
is the subject of the investigation, it is beyond the control of the experimenters. 
However, the fact that the cross section increases in proportion to the neutrino 
energy (<7 tot a £,,) can be used to the experimenters' advantage by raising the 
energy. Making N v and N, arget large is where the experimenters' skill comes in. 
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In neutrino experiments, one makes a gigantic target to increase N targe t, but in 
order not to lose generated events the target itself has to serve as a detector. A 
bubble chamber is an ideal target cum detector capable of reconstructing all the 
kinematic variables. Its operating principle was briefly described in Fig. 14.2. It can 
often record a single event that definitively detects a new phenomenon, depending 
on what one is pursuing. Good examples are the discovery of the Q particle to 
test flavor S U(3) (Fig. 14.6) and a neutral current event to confirm the Glashow- 
Weinberg-Salam electroweak theory [199, 200]. 

For precision neutrino experiments, however, one wants to have a more massive 
target/detector to gain high statistics. The size of the bubble chamber is inherently 
constrained by its use in a magnetic field. Gargamelle and BEBC (Big European 
Bubble Chamber) at CERN and the 15 foot bubble chamber at Fermilab are made 
with the highest technology available using superconducting magnets and yet lim- 
ited to ~ 10 tons at most because of the mentioned constraint [1]. More suitable is 
an electronic counter detector, the mass of which can be made arbitrarily large to 
gain high statistics. 

Generated particles in the neutrino interactions other than e and fi are almost all 
pions and nucleons, which are hadronized remnants of a quark jet (a phenomenon 
in which many hadrons are ejected in the same direction). Detailed identification 
of each hadron's motion is not so important as the collective behavior of the jet. 
The bubble chamber is overqualified for such a purpose. Eliminating unnecessary 
information drastically and concentrating on detecting the leptons and the total en- 
ergy of the hadrons, the detector structure can be simpler but massive. Figure 17.15 
shows a typical detector, that of the CDHS (CERN-Dortmund-Heidelberg-Saclay) 
group at CERN [59, 116]. Basically, they are made of iron modules (toroidal and 
magnetized) with drift chambers and scintillation counters interspersed among 
the modules to measure muon tracks and hadronic energy. The whole detector is 
divided into 19 supermodules and each supermodule is further subdivided into 5 or 
10 cm thick iron plates. The combination of iron/scintillator modules constitutes a 
sampling calorimeter for hadrons. The total weight is 1235 ton, and yet the detec- 
tor is electronics personified, equipped with a great number of readout modules. 
Information that the detector can supply is: 

1. Coordinates of the interaction point: from the transverse position the energy 
(eu) of the incoming neutrino can be determined using Eq. (17.100). 

2. Scattering angle 6 of the muon and its energy (a)'): the energy (momentum to 
be exact) is determined by the curvature of the muon tracks going through the 
magnetized iron. 

3. Hadronic total energy (£had): measured by the hadron calorimeter. 
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Figure 17.15 (a) View along the 19 mod- 
ules of the CDHS electronic detector at SPS. 
The black lightguides and phototubes, which 
are used to measure the hadron energy, can 
be seen sticking out of the magnetized iron 
modules. The hexagonal structures are the 
drift chambers that measure the muon tra- 
jectories [352]. PHOTO CERN. (b) Computer 
reconstruction of a charged-current neutrino 
interaction in the detector. The neutrino en- 



ters from the left. The top row gives the pulse 
height in the scintillator planes along the de- 
tector. This is a measure of energy deposited 
to determine the total hadron energy. The pan- 
els below show the measured coordinates in 
the three drift-chamber projections together 
with the fit curve for the muon reconstruc- 
tion. The curvature tells the momentum of the 
muon [3]. 



From the above information, s, x, y of charged current interactions (v tl + N ->■ 
fx + hadrons) can be determined as well as a self-consistency check made ( E^ad = 

CDHS used thick iron to gain mass, which is good for detecting muons and 
hadronic showers, the former because of its long range and the latter because of 
the collective behavior, but then the electron, which makes a short electromagnetic 
shower, is absorbed by the iron and undetectable. Charged current interactions 
induced by c v e cannot be distinguished from neutral current interactions, which 
makes active detection of, say, the v fl ->• v e transition impossible, an important 
phenomenon known as neutrino oscillation. In this case only the disappearance 

6) Neutral current phenomena (v,, + N ->■ v f , + hadrons), where no muons are observed, can be 
measured also, but in this case the lepton (neutrino) information is missing and no x information 
is obtained. 
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of the Vp can be measured by comparing fluxes at different places, which is much 
more difficult. If one wants to identify electrons and reconstruct their kinematics, 
one needs fine-grained sampling, which uses more readouts and reduces the target 
density, hence the total mass. This type would be the next generation of detectors. 



17.8 

v-p Deep Inelastic Scattering 



17.8.1 

Cross Sections and Structure Functions 



Feynman diagrams describing neutrino inelastic scattering are shown in 
Fig. 17.16b. In the low-energy region that we are dealing with (E v < 100 GeV), 
the W mass can be set as infinity and the interaction is well described by the four- 
Fermi Lagrangian. But to make the comparison with e-p scattering clear we draw 
the Wline in the diagram. Both processes are considered as two-body 
quarks by leptons in the parton model. Setting the electron and neutrino kinemat- 
ics identical and fixing the incoming energy, we see their scattering is a functi 
x and y only. If the scaling rule holds, x represents the parton's initial 
the distribution of which is given by q(x) irrespective of incoming particle species. 
Denoting leptonic and hadronic weak (charged) current as V and /'' , respectively, 
we can express the Lagrangian £ F as 
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v separately but in a similar form to e-p scattering, 




(a) (b) 

Figure 17.16 (a) e-p inelastic scattering, (b) v-N inelastic scattering in the quark model. 
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J2 &\P + q~Pr) Q) Y,{p\J?\Pt){P*\J?\p) 



( (P ■ q)q P \ ( (p-q)qa \ W 2 (Q?,v) 

V P q 2 JV" q 2 ) M^ 



+ [~ (q P qo)] + [~ (p P q a + q P Po)] + [~ i(p P q a ~ q P Po)] 

(17.102) 

where Pp is sum of momenta of final particles. The weak current contains the axial 
current, which is not conserved. The constraint due to the current conservation is 
absent and W" cannot be simplified like the electromagnetic current. However, 
the last term in Eq. (17.102) breaks T invariance, and the three terms in the last 
line are proportional to lepton mass (divided by incident neutrino energy) when 
multiplied by the lepton current and can be neglected compared to the first three 
terms. The third term is a new term, which is introduced by parity violation. The 
cross section can be calculated to give 

d 2 a v ^ (Gp/VI) 2 „„ - d 3 fe' 

dQdco' 4(k-p) L "" (27r) 3 2 W ' 

C = \ E »(fcV(l - y 5 )u(fe)u(fe)y"(l - y 5 )u(k') (17.103a) 

= 2 \k ,p k" + k'°k p - (k ■ k')g p " + ie paali k a k'A (17.103b) 

The neutrino being left-handed, a factor 1/2 for spin average is not necessary. Cal- 
culating Eq. (17.103), we obtain cross sections for v,v-N scattering: 

Changing variables to x, y, we find the structure constants become in the scaling 
limit 

M Wl(Q 2 ; y) _+ Fi(x) _ y W2(Q 2 ; y) _+ p 2{x) _ 

vW 3 (Q 2 ,v)^-F 3 (x) (17.105) 

The minus sign in front of F 3 is for convenience to make F 3 > 0. The cross section 
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%£ = g^ 1 - y) p ^*) + (y) 2xp rw ± K 1 - 1) * p ™] 

(17.106) 

Problem 17.13 
Derive Eq. (17.104). 

As the charge symmetry operation (e — ljri >') changes p — >• — n and J — >• ]^ , 
the following equalities hold: 

F vp = J? v» j F Jp = F vn ^ j = 1 ,2,3 (17.107) 

Then for an isoscalar target (Nj, = N„) 

F i" = (*?* + F;"")/ 2 = ( F J P + F f n )l 2 = F f (17.108) 

and the difference between a v and a v is only the sign of Ft,. In conclusion, the 
v, v-N cross section on an isoscalar target becomes 






Ty = i S i* 1 " » F ' <*> + (y) ^'M ± K 1 - f ) *"'<*>] 



Comparing with Eq. (17.70), we see that, apart from the F 3 term, the n 

section can be formally obtained from the e-p cross section by making the change 

^-3 < mio > 

This is natural if we consider the weak Lagrangian and the equivalent second-order 
electromagnetic Lagrangian. If the parton spin is 1/2 and the Callan-Gross equality 
(2xFi = F 2 ) holds, Eq. (17.109) can be rewritten as 

(17.111a) 

(17.111b) 

Note, the structure functions F; in the v-N cross section and those in the e-p 
cross sections differ in general. However, if the parton model is right, the structure 
functions are the partons' distribution functions in the nucleon and they should be 
related simply. Let us see how they are related in the quark model. 
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17.8.2 

v, v-q Scattering 

To calculate v (v)-q scattering cross sections, we make maximum use of the he- 
licity-tagged e-fi scattering cross section we derived in Chap. 7. As the weak La- 
grangian is given by 

-Z w = (G F /V2) [uy M (l - y 5 )d] [jty"{l - y 5 )v] 



the expressions for the e-fi scattering cross section decomposed in terms of helicity 
[Eqs. (7.31)-(7.33)] are directly applicable. We substitute 



t 41 

n the equations for the e-fi c 

do LL I a 2 i 

"^"Icm"^" 5 ' 



and use y = (1— cos d*)/2. Since the difference between the neutrino and antineu- 
trino appears only in its helicity as far as the cross section is concerned, conversion 
from efi to v q can be easily made: 

do Gl 

— = — f -s: (vq,vq scattering) (17.115a) 

^ = ^ s (! " V) 2 : ( vl i-vq scattering) (17.115b) 

Note that we do not take the spin average for neutrino scattering. 

When we deal with quarks in the nucleon, we replace s by x s and multiply by the 
distribution function qi(x), just as we did in the e-p inelastic scattering. As there 
are also antiquarks in the nucleon, whose distribution we denote by ~q i (x), we have 

J^ = ^ s J2 2x il'M + ^X 1 " y) 2 ] (17.116a) 

^ = g»E 2 *b'(*)( 1 -y) a +?.w] < 17 - 116b > 

Comparing with Eq. (17.111), we obtain 

F 2 v =X>[ gj (*) + «!(*)] (17.117a) 

^3=E 2 *M-?iMl ( 17 - 117b ) 
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17.8.3 

Valence Quarks and Sea Quarks 



There are two kinds of quarks in the nucleon, valence quarks, which define the 
quantum number of the nucleon (uud in the proton or ud din the neutron), and 
sea quarks, whose quantum numbers are the same as those of the vacuum. As 
the sea quarks consist of q{q t and for them qi(x) — q^x), Eq. (17.117) means 
F 2 represents all the quarks and antiquarks and F 3 the valence quarks only. If we 
denote the distribution functions of u, d, s quarks in the proton as u(x), d(x), s(x), 
charge symmetry requires those in the neutron to be d(x), u(x), s(x). If we neglect 
s quarks in the nucleon and c quark production, charge and lepton number conser- 
vation requires that reactions between neutrinos and quarks are limited to 



v fl + d ->• f.i +u, v fl + u-t/i + d 




(17.118a) 


v u + u^fi+ + d, v tl +~d^ [i + + u 




(17.118b) 


this we can write the structure functions of the proton 


and ner 


itronas 


F 2 vp (x) = 2x[d(x) + u(x)] 




(17.119a) 


F' 2 n (x) = 2x[u(x) + ~d(x)\ 




(17.119b) 



Therefore for isoscalar targets 

F 2 = ( F i P + F 2 n )/ 2 = x[u + d+u + ~d] (17.120a) 

xF; = x(Fl P + F™)/2 = x[u + d -u- ~d] = x[u v + d v ] (17.120b) 

We have marked the distribution function of valence quarks with the subscript v. 
Equation (17.116) is rewritten as 

(17.121a) 

(17.121b) 

But this time q(x) = u(x) + d(x), q(x) = u(x) + d(x). For e-N scattering the cross 
section is proportional to the charge squared and we obtain from Eq. (17.72) the 
following expressions: 

F 2 " = *^( M +«)+i(d + i)] (17.122a) 

F 2 e " = x^(d + d)+i(u+u)l (17.122b) 

Therefore for an isoscalar target 

F*=-?-x[u+d + u + j\ = ^-x[q{x) + q(x)] (17.123) 
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Therefore, if the quark model is right, the structure function for the isoscalar target 
has to satisfy 



17.8.4 

Comparisons with Experimental Data 

Total Cross Section Integrating Eq. (17.121) over x and y 

<t = MQ+ Q/3) (17.125a) 

<t = ff o(Q/3+Q) (17.125b) 

<70= 2^ S ' Q=f dxx l( x )> Q=jdxxq(x) (17.125c) 

This means that as a result of Bjorken scaling, the total cross section of the neutrino 
increases in proportion to the incident neutrino energy 

r/£f oc s = 2Mw (17.126) 

The compilation of neutrino total cross section given in Fig. 17.17 confirms the 
above conclusion well. 

If the rule a tot oc u> breaks down, it may be due to either 1) the effect of the 
Wmass or a higher order effect of the weak interaction or 2) the parton model is 
bad (needs QCD correction). To date, the slope of the neutrino cross section for 
a> < 350 GeV is constant and no sign of breakdown is seen. 

Antiquarks in the Nucleon If we set /J = ~QJ(Q+ Q), which is the fraction of the 
antiquark's momentum, fi can be calculated from the total cross-section ratio: 



Q+3Q _ alJE _ 
1Q+Q~ <t/ E ~~ 



which gives /3 = 0.175. 7 ' 



da/dy Integrating Eq. (17.121) ov 


er x, we obtain the y distribution 


do" - 

-^ = o [Q+Q(l-y) 2 ] 


(17.128a) 


^ = a [Q(l-y) 2 +Q] 


(17.128b) 


7) Rigorously, the distribution functions are 


not functions of x only but contain a Q 2 -dependent 



ection (QCD effect), which makes /3 energy dependent. We ignore its effect her 
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In the limit Q » Q, 



dy 



(17.129) 



Remembering that the cross sections of v, v with a point particle are given by 
Eq. (17.115) and has the shapes da/dy(vq) ~ 1, so/dy(vq) ~ (1 — y) 2 , we see 
that the cross sections of v with nucleons directly reflects those of point particles. 
We present actual data [3] in Fig. 17.18. The assumption of no antiquaries in the 
nucleon Eq. (17.129) is almost valid, but the effect of a small mixture of antiquaries 
is seen as a deviation at y sa 1. 

Electric Charge of the Quark According to the quark/parton model, the structure 
functions represent momentum distributions in the nucleon. They are given by 
Eqs. (17.120) and (17.123) and represent the same function apart from their coeffi- 
cients. Thier ratio should be 5/18 if the quark model is right. Figure 17.18b shows 
a comparison of neutrino data with e-p and fi-p data [3]. The agreement with pre- 
dictions is excellent and supports the idea of the parton being the quark. 

Later observations on various processes, including the R = a(e~e J> ~ — > all 
hadrons)/cj(e _ e + — > fi~ fi + ) annihilation process and jt° — >• 2y, show good 
agreement with the quark model (Sect. 14.3). It was also shown that no parton 
models with integer charge can explain all the data, but the quark model can give 
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Figure 17.18 (a) Except for a small shift due 
to a small mixture of antiquarks, the parton 
model formulae do v /dy ~ 1, da"/dy ~ 
(1 — y) 2 agree well with the data, (b) Com- 
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F 2 (x)_ = q(x) + q(x), xF 3 (x) = a(x) -q(x) 
and q (x) are plotted. •, o, ■ are v data, 
D = up, A = eD data [3]. 



satisfactory explanations for all of them [246]. Thus it was proved without doubt 
that the parton is nothing but the quark itself. 



17.8.5 
Sum Rules 



Expressing the quantum numbers of the nucleon in terms of quarks, we have the 
following sum rules. Here we include s quarks. 

Baryon number: B = 1 = dx (-\[u-u + d -~d + s -J\ (17.130a) 

Isospin: I 3 = 1/2 = j dx Q) [u- d - u + 1] = j dx Q) [u v - d,] 

(17.130b) 

e -.-/*[(i)<.-*-G)«-J + .->>] 



Strangeness: S 



-•-/*•!•-' 



(17.130c) 
(17.130d) 
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From Eq. (17.122) we also have 

f dx[(u y -d y ) + 2(u-d)] 



■y< 



Combining the S U(3) symmetry assumption u(x) = d(x) = s(x)(= s(x)) with 
Eq. (17.130b), we have another equality 



U.fi 



3 



(17.132) 



They are not all independent but are related by Nishijima-Gell-Mann's law. 

Equation (17.130a) is referred to as the Gross-Llewellyn Smith (GLS) sum 
rule [188], Eq. (17.130c) as Adler's sum rule [6] and Eq. (17.132) as Gottfried's sum 
rule [182]. Verification of these sum rules is not easy, as accurate data in the region 
x ~ are necessary. Experimentally, data are available for 0.003 < x < 0.8 and 
B ~ 0.9 ± 0.2, U ~ 0.227 ± 0.016. We may say the GLS and Gottfried's sum rules 
are roughly valid. 

Existence of the Cluon As the total momentum of the quarks and gluons should 
add up to that of the proton, 



-- j dx[Y,xqi(x) + xg(x)\ 



where g(x) denotes the gluon momentum distribution function. However, 
the gluon does not interact with the electron or the neutrino, the total 
Pq of quarks that appear in Eq. (17.133) is given by 



= fdxY,x qi ix) = l-jdxxg(x 



The total momentum of quarks and antiquarks is given by Q + Q. From 
Eq. (17.125) and the experimental value of the total cross section, we have the 
total n 



0+0 = — 5 ( oL + cr t " nt ) = 0.5 ± 0.04 (17.135) 

*" 4G F 2 «M v tot to " v ' 

Since sum of momenta obtained from e-p scattering must give an identical value, 
there are particles which carry the rest of the momentum but do not interact weakly 
or electromagnetically. This was the first indication of the gluon' s existence. The 
gluon carries half of the nucleon momentum, which means it exists in the nucleon 
not instantaneously but permanently. 

In summary, we have shown that the parton consists of the quark and the gluon. 
Direct evidence of the gluon was later presented by three-jet production in the 
electron-positron annihilation process e~ e + -*■ q + q + g [43, 291] (Fig. 17.19), 
and that it has spin 1 was also confirmed [363]. 
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Figure 17.19 Direct production of the gluon 
a + q + g in the JADE experiment [291]. 



17.9 

Parton Model ir 



17.9.1 

Drell-Yan Process 

The Drell-Yan process is a way of producing a (large mass) muon pair in hadronic 
reactions. It is a good test bench for applying the parton model adopted in lepton 
deep inelastic scattering to hadronic processes. We have learned that the nucleon, 
a representative hadron, can be treated as a collection of partons in the infinite mo- 
mentum limit. If the high-energy hadron reactions, which produce a great number 
of many other hadrons, can be interpreted as elastic or quasielastic parton scat- 
tering, interpretations of hadronic processes would be greatly simplified. If an an- 
tiparton in one of the hadrons annihilates with another parton in the other hadron, 
producing a lepton pair via electromagnetic interaction, it would provide an easily 
recognizable object, especially if it is hard, i.e. has a large invariant mass. Let us 
consider the reaction 

p{orjT, p) + p ->• fi~+fi + + X (17.136) 

where X means anything. In the parton model, the process can be considered as 
the one depicted in Fig. 17.20. In the parton model, a quark q having momentum 
Pi = %i Pi pair-annihilates with an antiquark q having p 2 = x 2 P 2 to become a 
muon pair. We denote the kinematic variables as 

q(p\) + q(p 2 ) -> fl-(h) + fi + (k 2 ) (17.137) 

Writing the invariant mass and momentum of the muon pair as m, qf 1 and neglect- 
ing the parton mass, we have in the center of mass (CM) frame of the colliding 
protons, 
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Figure 17.20 In the parton model, p + p -*■ ft /< + + X is replaced by qq — *■ y* — *■ fi /i~^ . 

p 1 = x 1 P 1 ~x 1 (E,0,Q, P) (17.138a) 

p 2 = x 2 P 2 ~ x 2 (E,0,0,-P) (17.138b) 

q = xyP-L + x 2 P 2 = [(xi + x 2 )E,Q,0,(x 1 -x 2 )P] (17.138c) 

where we have neglected the proton mass and set E ~ P. From this we can define 
the invariant mass of the muon pair 

m 2 = q 2 = {x l + x 2 ) 2 E 2 -(x l -x 2 ) 1 P 2 ~ Ax x x 2 E 2 = x x x 2 s = ts (17.139) 

Note, here, q 2 > 0, namely it is time-like, in contrast to q 2 < in deep inelastic 
scattering. The cross section for qq — >• fi~ [i + can be obtained from that of ee -+ 
^[Eq.(7.38)] 

o tot (e-e+ -> fi-fi + ) = ^- (17.140b) 

by replacing e _ e + by qq. Namely, we replace s by m 2 and correct for the quark 
charge X q and color degrees of freedom. As there are three colors R, G, B for 
the quarks but the muon pair is colorless, only combinations of RR,GG and 
BB are allowed. Since each color exists with probability 1/3, the color factor 
is (1/3) 2 x 3. Setting the probability of the quarks having momentum X\,x 2 as 
q(x 1 )q(x 2 )dx 1 dx 2 , we obtain 



(17.141) 
We introduce a directly observable variable referred to as the "Feynman x" 

x ¥ = x l -x 2 = ±2\q\l^~s (17.142) 



7. 9 Part-on Model in Hadron-Hadron Collisk 



where q is the momentum of the muon pair in the p p CM frame. Using the Feyn- 
man x, s) we can rewrite 

dm 2 dx F 

dx x dx 2 = — (17.143a) 

(x 1 + x 2 )s [ ' 

d 2 do = ATta 2 y^ ^ [qiWq^Xj) + q^x^q^)] 

dm 2 dx ¥ 9m 2 s L> ; (x x + x 2 ) [ ' ' 






J^^iM^qA^) + ?i(X2)q ; (xi)] (17.143c) 



9r 2 s 2 ( Xi 
x 1; x 2 can be calculated from two observables r, x F : 

(17.144a) 

(17.144b) 

The significance of Eq. (17.143c) lies in the fact that we can use parton distribution 
functions determined by deep inelastic scattering. Note, however, that the kinemat- 
ical region of its applicability has to be chosen with care. The parton model is valid 
only in hard reactions, namely, energy and momentum transfer associated with re- 
actions have to be large. The parton's momentum itself has to be larger than 1 GeV. 
From experience in e(v)-N scattering, it would be safe if x > 1/E. Assuming the 
conditions are satisfied, we examine Eq. (17.143b) at x-p = (x\ = x 2 = ~Jr). It is 
expressed as 

This means we can observe the product of distribution functions and check the 
validity of the parton model. To obtain da/ dm 2 , we may integrate Eq. (17.143b), 
but it is more convenient to go back to Eq. (17.141) and use the relation 

1 = d(m 2 - Xl x 2 s)dm 2 (17.146) 

After changing the order of integration we obtain 

, da 4-jta 2 dL 

"^ = — T i7« (m47a) 

^(t) = J dx x dx 2 d(x 1 x 2 -t)J2^i(ximx2) + qi(x 2 )q i (x 1 )] (17.147b) 

dL/dt(r) can be interpreted as the luminosity of the parton collision. According to 
Eq. (17.147), the lhs does not depend on energy if r = m 2 /s is kept constant, in 
other words, a scaling law holds for the Drell-Yan cross section. Figure 17.21a [79] 
shows the Drell-Yan cross sections at 200, 300 and 400 GeV. Scaling is confirmed 
and we have also shown the validity of the parton model in hadronic n 

8) The "x" of deep inelastic scattering is referred to as the Bjorken x. 
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(b) Angular distribution of muon pairs showing 1 + cos 2 8 characteristic of qq -*■ fiji [79, 207]. 

Note, however, that although momentum and angular distributions are well re- 
produced, the absolute cross section exhibits a small discrepancy; the difference is 
referred to as the if factor 

Ida/dm 2 )^ 

Y — 1 1 ' p n7 14.8\ 



Observation [79] shows that K ~ 1.6 in t 
arise from QCD higher order corrections. 



ergy region and is considered to 



17.9.2 

Other Hadronic Processes 



The formalism we developed for the Drell-Yan process can easily be extended to 
other hadronic reactions. Let us consider a process where hadron A and hadron B 
collide and emit a parton or a lepton, which we denote generically as c: 



17.10 A Glimpse ofQCD's Power I 725 

The reaction cross section of the process can be expressed in parton language as 
that of parton a in hadron A and b in B [= a(ab ->• cX)] multiplied by parton dis- 
tribution functions. If we use f£(x a )dx a to denote the probability of parton a being 
found in hadron A with momentum x a , the hadronic cross section is expressed as 

a(AB ->• cX) = f dx fl dx(, &(afe ->■ cX) 

x[/Jf(*.)/l(*i) + (A4>fl if a # i)] (17.150) 

Here a denotes the elementary process by partons and is to be understood as the 
average of the color degrees of freedom for the initial state and the sum over the 
final states. Writing the total energy of the partons a and b as s, we have 

s = x a x h s = TS (17.151) 

The formula corresponding to Eqs. (17.147) can be written as 

^-(AB -> cX) = ^(r)cj(ab -* cX; s = is) (17.152a) 

^(r) = J dx a dx b d(x a x b - r) [f a A (x a )f h B (x h ) + (A ** B if a # b)] 

(17.152b) 
As for observables, rapidity r/ is commonly used in addition to Feynman x, where 

E ab = x a E + x h E= (x a + x b )E (17.153a) 

Pah = x a P + x h (-P) ~ (x B - x fc )£ (17.153b) 

l ln (^ + P a Ml) = l ln ^ 

2 (Brt-P.ni) 2 *fc 
x a , x b can be obtained using Eq. (17.144) if r and x F are known, but when t and ?7 
are known, they can be obtained from 

x a = V^e+" , x fc = v^e"" (17.154) 

In terms of rapidity, the cross section is expressed as 

■£i = d£k = [fiMflM + (A- B if a # *)] & (17-155) 

These formulae are used for expressing cross sections for production of W, Z and 
jets in hadronic reactions. 



17.10 

A Glimpse ofQCD's Power 

We have learned that partons are quarks and gluons and that gluons carry a fair 
amount of their parents' momentum. The dynamics of quarks and gluons have 
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Figure 17.22 Deep inelastic scattering in QCD. Partons are made of quarks 
and gluons are constantly interacting with each other. For a complete QCD 
tribution of the gluon has to be included. 



been investigated successfully using QCD, a color S U(3) gauge theory. In its frame- 
work, the deep inelastic scattering process can be thought of as the electron inter- 
acting with one of the quarks (Fig. 17.22). In the parton model, partons are treated 
as mutually independent and the whole reaction can be thought of as a sum of 
independent e-q elastic scatterings. In terms of the equivalent photon scattering, 
the virtual photon carrying momentum q and mass Q 2 = — q 2 is absorbed by a 
quark. In QCD, the quarks are accompanied by gluon clouds, just like the electron 
or the proton is surrounded by photon or pion clouds. When the probe used to look 
into the inner structure of the target is soft (namely when the momentum transfer 
it carries is small), it does not have much resolving power. The virtual photon is 
simply absorbed by a parton carrying fractional momentum x of the target proton 



'= q + zp-q+xP 




Figure 17.23 Deep inelastic scattering pro- 
cess in QCD. (a) As the probe becomes hard- 
er, it begins to see a structure in the quark. 
The quark is split from the gluon cloud, (b) 



The parton model is equal to the Born tt 
(c,d) The next higher order 
Compton-like processes. 



7.10 A Glimpse ofQCD's Powi 



changing its momentum from p = xP to p' = q + xP. In QCD the process can 
be considered as the Born term depicted in Fig. 17.23b. But as the probe becomes 
harder and penetrates deep into the object, it begins to resolve the core from the 
cloud. 

When the photon sees a magnified quark, the probability of finding a quark in 
the quark-gluon cloud composite is not 100%. In the infinite momentum frame, 
we may consider that the quark in the cloud carries a fraction z of the total mo- 
mentum, and the virtual photon interacts with it. The process can be expressed 
pictorially as in Fig. 17.23a. The proton carrying momentum P supplies a parton 
with fractional momentum p = yP. Subsequently the photon sees a magnified 
quark interacting with the quark in the cloud and picks up fraction z of p = y P. 
The parton with p — yP comes out as one carrying q + zp = q+xP,x = zy. 

Just as the parton model treated the proton as a collection of partons, the photon 
with larger Q 2 sees a parton split into a collection of partons. The parton model is 
then the Born term (Fig. 17.23b) of the whole reaction (a), and one can think of the 
next-order QCD process as depicted in Fig. 17.23c-f. In Fig. 17.23c,d, the photon 




O- (CcV-t 

Figure 17.24 F 2 (x, Q 2 ) plotted as a func- the ZEUS [399], NMC [295], BCDMS [52] and 

tion of Q 2 for different values of x. The data E665 [4] groups. The lines are NLO (next to 

are a compilation of those obtained from e- leading order) QCD fitting. 
P and f.i-P scattering and include those of 
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sees a quark split into a quark and a gluon and in Fig. 17.23e,f, a gluon split into a 
q q pair. 

The whole process is formulated as the DGLAP (Dokshitzer-Gribov-Lipatov- 
Altarelli-Parisi) evolution equation [17, 18, 123, 187]. We do not go into the details 
of its calculation here, but let us catch a glimpse of its power. Figure 17.24 shows 
an example of the DGLAP formula fitted to the structure function. The data clearly 
show variation of the cross section as a function of Q 2 i.e. the breakdown of the 
scaling, hence the naive parton model. The higher order QCD effect is at work, 
as shown by the various slopes of the distribution of the data at fixed x, and can 
reproduce the data correctly for a vast range of kinematic variables. The qualitative 
features of the data can be understood as follows. As Q 2 grows, the virtual photon 
sees more and more gluons, which in turn create q q pairs. This means the number 
of quarks in the large x region decreases while that in the small x region increases. 
This sort of behavior originates from multiple emission of gluons, which is com- 
mon to all gauge field theories. But this amazing data reproducibility together with 
other evidence we mentioned in describing the quark model in Chap. 14 firmly 
established the validity of QCD as the correct theory of quarks and gluons. 



18 

Gauge Theories 



Historical Prelude 

The development of particle physics during the period beginning in the 1970s com- 
pletely changed our view of matter. For the first time in history, we began to think 
we could answer the question about what rules control the world of matter. Some 
may feel it exaggerated to claim that much, but the success of gauge theory and 
the resultant changes of our view on matter were so revolutionary as to justify 
these words. It is an epoch-making phenomenon comparable only to the establish- 
ing of relativity and quantum mechanics. The gauge principle is now considered 
a fundamental cosmic principle in conjunction with relativity and quantum me- 
chanics. Yet the gauge principle thus established was by no means a new idea. It 
was well known that the Maxwell equations satisfy it, and it was in 1918, imme- 
diately after the proposal of general relativity, that Weyl published the first gauge 
theory [387, 388]. Why did it take such a long time to recognize it as one of the 
most fundamental principles governing nature? The reader may refer to [304] for a 
history of how gauge theory developed. 

First of all, a long time was spent establishing self-consistent quantum field the- 
ories. It was in the late 1940s that QED, the first quantum field theory based on 
the gauge principle, was established. Divergence, an inherent difficulty of quan- 
tum field theory, was treated by renormalization. Gauge theory could have made 
more progress triggered by this opportunity, but there were several factors hin- 
dering it. First, renormalization formalism depends on a recipe in which a finite 
term is extracted by subtracting infinity from infinity, and this was not considered a 
refined theory mathematically. Second, field theoretical treatment of strong interac- 
tion phenomena faced the difficulty that the coupling constant was large (~ 10) and 
perturbation approaches did not work well. Other methods were no better. Above 
all, in the 1960s the idea of a particle democracy, in particular the bootstrap model, 
according to which all the particles are made of other particles, gained considerable 
support. The notion of fundamental particles and confidence in field theory itself 

Furthermore, the weak and strong interaction phenomena did not reveal the 
characteristics that any gauge theory should respect. First, the force has to be long 

Elementary Particle Physics, Volume 1: Quantum Field Theory and Particles. Yorikiyo Nagashima 
"" ' ; GmbH & Co. KGaA, Weinheim 
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range, like the Coulomb force, but both were short range. Today, we know this 
characteristic is hidden for different reasons, the weak force due to spontaneous 
symmetry breaking and the strong force due to confinement. We were finally able 
to understand their true nature as a result of our perception that quarks and lep- 
tons are the fundamental constituents of matter. Based on the quark model, it was 
possible to sort out and organize essential features from a proliferation of hadronic 
and weak interaction phenomena, thus enabling us to see through this framework 
how the fundamental forces operate. 

The second characteristic of gauge theory is its universality. The strength of grav- 
ity/electromagnetism depends only on the mass/electric charge and not on matter 
species or its form. General relativity is also a sort of gauge theory, but because of its 
mathematical complexity it took a long time to fully appreciate this [202, 230, 376]. 

The third characteristic is that the source of the force in gauge theory must be 
a conserved quantity. Energy momentum/electric charge, the sources of the gravi- 
tational/electromagnetic force, were known to be strictly conserved quantities. On 
the other hand, there were no such conserved quantities in strong and weak inter- 
action. At least they were invisible to our eyes. The reason is the same as for their 
short-range character. 

The fourth characteristic is that in strong and weak interactions the action of the 
force changes the identity of the particle (exchange force). For instance, the proton 
nterchange with each other under the action of the nuclear force and 
o is transformed to the charged lepton. It was only after Yang and Mills's 
mathematical formulation [396] that extended the applicability of gauge theory to 
exchange forces that they were reconsidered in its framework. 

Despite its great success, the gauge principle is not an easy concept and is pre- 
sented in a formal way in most textbooks. Here, we try to explain it more physically, 
using known concepts and analogies, for an intuitive understanding but with some 
loss of mathematical rigor. The explanation will take a somewhat roundabout route, 
so let us list the logical steps we follow from now on. 

1. First we define the gauge principle, mathematically and formally. 

2. Then we try to apply a geometrical interpretation to the mathematical formula 
and attach physical meanings, making correspondence with gravity (general 
relativity). 

3. We introduce an experiment to demonstrate the Aharonov-Bohm effect as 
proof that the gauge potential is an observable in quantum mechanics and plays 
a more fundamental role than the electromagnetic field strength. 

4. We extend gauge symmetry to nonabelian groups using S 17(2) (isospin) sym- 
metry. We show, applying the geometrical interpretation to gauge theory, that 
extension to nonabelian groups is straightforward and their formulae are easily 
derived in a natural and logical way. 

5. QCD is a direct application of S L/(3) gauge theory to quark colors. We derive the 
asymptotic freedom qualitatively, whose discovery elevated QCD to the theory 
of the strong interaction historically. We also discuss confinement in a qualita- 
tive way. 
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6. Establishing the weak interaction theory is more complicated in two ways. First, 
it mixes with the electromagnetic interaction and the symmetry has to be slight- 
ly modified to S U(2) L x Lf(l)y. Glashow's scheme to combine the two forces is 
discussed first. 

7. Second, the weak force carrier, the W^ boson, has a large mass. If the weak 
force respects the gauge principle, the symmetry is broken and as a result the 
theory is unrenormalizable. To rescue the situation, the concept of spontaneous 
symmetry breaking is introduced. Within its framework, a symmetry-breaking 
phenomenon can be realized without breaking the symmetry mathematically. 
Then the Higgs mechanism to attach finite mass to particles is discussed. 

8. Combining (6) and (7) the GWS (Glashow-Weinberg-Salam) model, also 
known as electroweak theory, is constructed. 

QCD and the GWS model are collectively dubbed gauge theories based on 
S LT(3) co i or x SU(2) L x U(l) y and referred to as the "Standard Model" of parti- 
cle physics. 



18.2 

Gauge Principle 



Formal Definition 

We review QED as the model of the gauge theory to start from. From the La- 
grangian's invariance under global (or first-kind) gauge transformation for a 
fermion field ip 

ip(x)^ip'(x) = e- ia ip(x), lp(x)^lp'(x) = y(x)e iu (18.1) 

Noether's theorem guarantees the existence of a conserved current 

Jem'' = q^Y 1 '^ • 9 ,< Jem'' = (18.2) 

where q is the signed electric charge in units of proton charge. A conserved charge 
operator Q is defined as the integral of the time component of the conserved cur- 
rent, which is a generator of the gauge transformation (see arguments in Chap. 5) 

Q= q I d l xlpy %p , e'2"i/>e~ ie " = e'^xp (18.3) 

where the factor q is extracted from the phase considering the different charge 
that each fermionic field carries. The gauge field (photon) is said to couple to the 
conserved current when the interaction Lagrangian is expressed as 

£em,int = -q'Wy^A,, (18.4) 
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The photon field satisfies the Maxwell equations 

d u F' tv = qj v (18.5a) 

dx F flv + dp V v x + 3 V F Xpl = (18.5b) 

F /iV = d li A v -d v A fl (18.5c) 

which are invariant under the gauge transformation of the photon field 

A„(x) -* A'„(x) = A fl (x) + q~%a(x) (18.6) 

Here a is an arbitrary well-behaved continuous function of x, not related to a in 
Eq. (18.1) at this stage. The field xp satisfies the free Dirac equation, but if it is gauge 
transformed with a(x), which depends on x (local or second kind), according to 

ip(x) -► V'(*) = e- J °Mv(*) - nx)^yJ'(x)=lp(x)e'"^ (18.7) 

the transformed field xp' no longer satisfies the Dirac equation because the differ- 
ential operator brings d fl a(x) into the equation of motion. However, a Dirac field 
coupled with the gauge field has Lagrangian 

£em = ?[iy"(3„ + iqA fl ) -m]ip (18.8) 

which is invariant under simultaneous local gauge transformations of both xp and 
A^ if one uses the same a(x). In other words, if the force field is inserted in the La- 
grangian in such a form that the differential operator 9„ is replaced by the covariant 
differential operator D M 

d fl -» D fl = 3,, + iqAf, (18.9) 

the equation of motion is invariant under the local gauge transformations provid- 
ed both the fermion and the force fields are transformed simultaneously. This fact 
has been known for quite a while under the name "minimum interaction". Gauge 
theory has promoted it to a principle. 



Gauge Principle 

"The equation of motion is invariant under a local gauge transformation." 



It may also be stated as follows. When the Lagrangian we deal with is invariant 
under local gauge transformations, a conserved object, which may be called elec- 
tric charge (or more generally hypercharge), is the source of the force field. The 
interaction Lagrangian that couples the matter (fermion) field with the force field 
is obtained by replacing the differential operator in it by the covariant differential. 
The force field introduced this way is called the gauge field. If we make an interact- 
ing system from a free field Lagrangian, which respects a global gauge symmetry, 
by replacing the derivative by the covariant derivative, we say the system is gauged. 
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Conserved quantities can be obtained by symmetry transformations, but not all 
of them are sources of a force. Nowadays, all the fundamental forces (strong, elec- 
tromagnetic, weak and gravity) are considered as based on the gauge principle, and 
only those conserved objects based on the gauge principle are strictly conserved. 
Those not based on them (including lepton and baryon conservation) are consid- 
ered ineligible for the name. In fact, in GUTs (grand unified theories), which deal 
with very high energy phenomenon that could only be realized at the time of big 
bang, they are broken. 

In nonabelian gauge symmetry the phase transformation is extended to multi- 
component fields in matrix representations. Examples are those of S U(2) (isospin) 
and S U(3) (unitary spin) we met in Chap. 14. 



18.2.2 

Gravity as a Geometry 



According to Einstein's theory of general relativity, a particle always moves on a 
straight line (geodesic line to be exact) in space-time. According to it, space-time 
is not flat as we perceive intuitively but curved. How it is curved is determined by 
the presence of matter (energy-momentum tensor). As it is hard to imagine the 
deformation of space-time in the four dimensions in which we live, we consid- 
er a two-dimensional space. Then, figuratively speaking, the space behaves like a 
stretched rubber sheet. When it is empty without matter, it is flat (Euclidean space) 
and Cartesian coordinates can be set up (Fig. 18.1a). A mass point or particle A 
on it moves on a straight-line trajectory. Its motion of equation is Newton's first 
law (law of inertia), or its extension to special relativity satisfying Lorentz invari- 
ance dp/dr = mdv/dt = 0, where r is proper time. When there is a mass B, it 
deforms the space as shown in Fig. 18.1b. The space is no longer flat, but parti- 
cle As motion still follows a geodesic line (i.e. a straight line in the curved space). 
However, projected on Cartesian coordinates it looks like a curved line. Then it is 
interpreted as particle As motion being affected by B's force bending its trajecto- 
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Figure 18.1 Geometrical interpretation of 
gravity, (a) Space-time is like a stretched rub- 
ber sheet. If there is no matter, it is flat and 
Cartesian coordinates can be set up. A point 
mass particle A moves on a straight trajectory 
obeying the law of inertia, (b) When there is 
a big mass B, the space is deformed (curved) 






(b) 



and no Cartesian coordinates can be set up. 
As trajectory is still a geodesic line, namely 
a straight line in the curved space, but pro- 
jected on Cartesian coordinates it is curved 
and is interpreted as dynamical bending of the 
trajectory of A attracted by B's gravity. 



734 I 18 Gauge Theories 

ry. Gravity as a dynamical phenomenon between two massive objects, in Newton's 
concept, has been replaced by a geometry of space-time, in general relativity. 

To describe the equation of motion, we need 1) to know how space-time is de- 
formed by the presence of matter and 2) to define geodesic lines in a curved space. 

We will see that the gauge force can also be interpreted geometrically. We con- 
sider components of complex matter fields as vector components in a coordinate 
frame in an internal space and gauge transformation as a rotation (or more gener- 
ally coordinate transformation) in the internal space. Then the force can be inter- 
preted as the effect of a curved hyperspace, the space-time and the internal space 
combined. The source of the gauge force is the (electric) charge, whose presence 
deforms hyperspace just as a mass deforms space-time. We interpret the motion of 
the particle as following a geodesic in hyperspace, which is curved when projected 
on space-time. 

18.2.3 

Parallel Transport and Connection 

In order to construct mathematical formulae for geodesic lines in a curved space, 
we need to understand the notion of parallel transport of a vector. Parallel trans- 
port is the action of moving a vector from one place to another without changing 
its magnitude or direction. Then a geodesic line is defined as a trajectory that can be 
obtained by parallel transport of its tangential vector. In space-time, where its met- 
ric (equivalent of length in Euclidean space) 1 ' is defined, it can be shown that the 
geodesic line is also the shortest path connecting the two points. Therefore, it is an 
extension of the concept of a straight line in Euclidean space. Let us start by defin- 
ing a straight line locally. If a line is represented by its direction (tangential vector), 
the definition of a straight line is that when its tangential vector is parallel trans- 
ported by an infinitesimal distance from x = (x, y) to x + dx = (x + dx, y + dy), 
it does not change its direction or, equivalently, components of the vector do not 
change. Translated to a particle's motion, it can be described by Newton's first law, 
that is, the particle does not change its direction when it has moved an infinitesimal 
distance in time dt. Thus a straight line is defined by the statement that the time 
derivative of the particle's spatial coordinates (velocity) stays constant (Fig. 18.2a): 

Straight line in Cartesian coordinates: -^ = or -j- = (18.11) 

However, if we use curvilinear coordinates, the mathematical expression 
Eq. (18.11) has to be modified. In order to see this, let us view the same line 
in polar coordinates. This is a frame where two base vectors (e r , eg) to define its lo- 
cal coordinates are directed in the r (radial) and 6 (orthogonal to radial) directions 

is rj flv is referred to as the metric tensor. In 

general relativity, the metric tensor is a 
function of space-time point x and denoted 
-dy 2 -dz 2 as £„„(*). 
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Figure 18.2 (a)Atangenti 

inthedi 

coordinates has the same components i 

erywhere. (b) In polar coordinates, while 

direction of the vector stays unchanged, 

direction relative to the local coordinate 



components are functions of the space point, 
(c) A vector Va(x) at point A = x is parallel 
transported to a point B = x + dx without 
changing its magnitude and direction. It is 
denoted as V A »{x + dx). 



(Fig. 18.2b). The directions of the coordinate frame set up at space point x = (r,0) 
point away from the origin (e r ) or are rotated by jt/2 (e#), and are inclined relative 
to the Cartesian setting. To define a straight line locally, the change of base vectors 
has to be compensated. 

Let us find an expression for vector parallel transport. A vector V A (x ) shown in 
Fig. 18.2c at point x = (r, 0) has components V A = (V r ,V e ) = (Vcos cp, Vsin 0) 
in polar coordinates as well as in Cartesian coordinates. Note that we have delib- 
erately set the point A at 8 = to make their components identical. When it 
is parallel transported to point B, x + dx = (r + dr, 6 + dd), we denote it as 
Vb = Va||(x + dx). Here, the base vectors of the polar coordinate at x + dx is 
inclined by an angle dd relative to those at x. Components of V A || are the same 
everywhere as those of V A (x) in Cartesian coordinates, but in the polar coordinate 
frame correction terms to compensate the directional change of the base vectors 
have to be included: 

V A |I r (x + dx) = V Br = Vcos(<p-d6) ~ V cos (p + dOVsinip 

= V r + ddV e (18.12a) 

V A}}e (x + dx) = V Be = Vsin(0-d<9) ~ V sin - AO Vcos 

= V e -dGV r (18.12b) 

This example clearly shows that vector components in the polar coordinates change 
their values when parallel transported. In differential geometry, the transported 
vector is generically expressed as 



Vu (* + <**)= V(x)-r v (x)V(x)dx 



(18.13) 



The correction term is assumed to be a linear transformation of the original vector 
and proportional to the displacement if it is small. The assumption is justified in 
our example Eq. (18.12). T v introduced in Eq. (18.13) is referred to as the "con- 
nection". Note that r v is a space-time vector, as indicated by its subscript v, but it 



736 I 18 Gauge Theories 



35° 150° 165" 180' 165° 150" 135° 120° 105' 90° 75° 




Figure 18.3 A great circle connecting Tokyo 
and San Francisco is a geodesic on the earth' 
surface (sphere). It looks curved when pro- 
jected onto Cartesian coordinates in a flat 

a local orthogonal coordinate systei 






face, an airplane flying along the great circle is 
being parallel transported. However, its direc- 
tion with respect to local coordinates changes 
incessantly. 



is also a matrix acting on an object to be parallel transported. Since in Eq. (18.13) 
V itself is a space vector, r v here is a n-component vector whose components are 
nx n matrix where n is the dimension of the space-time. 

In differential geometry (and in general relativity), where a metric is given the 
connection is a derived quantity in terms of the derivatives of the metric tensor 
g /lv . 2) In Cartesian coordinates, all the coordinates are globally aligned in the same 
direction and components of the parallel transported vector are the same every- 
where, namely, the connection identically vanishes everywhere. The polar coordi- 
nates we described above are set up on a flat plane, hence they can always be trans- 
formed back to the original Cartesian coordinates. In other words, there always 
exists a transformation that makes the connection vanish identically in a flat space 
by a suitable transformation. Since Cartesian coordinates can only be defined in a 
straight and flat space, the existence of Cartesian coordinates is equivalent to the 
space being Euclidean. 

2) It is defined as 



: In differential geometry, t\ 



■ dvgXp ~ Bxgpv) 
(18.14) 
Med to as the Christoffel 
of the space (see Footnote 1). 



are defined. The Christoffel cc 

but not identical to one of them because in 
differential geometry the base vectors change 
their magnitude as well as their direction. 
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Next, we consider the earth's surface, where the coordinates are expressed in 
terms of two angles (longitude, latitude). They are orthogonal coordinates every- 
where and locally (i.e., in a limited region) are almost identical to Cartesian coor- 
dinates (with suitable rescaling), but globally they are not. A geodesic line (great 
circle) connecting Tokyo and San Francisco is depicted in Fig. 18.3. The line's di- 
rection changes continuously relative to local coordinates. Here, also, parallel trans- 
port can be defined using the connection. But this time, no Cartesian coordinates 
can be set up on the sphere. Therefore there is no transformation to convert the 
(longitude, latitude) system to Cartesian coordinates or, equivalently, there are no 
transformations to make the connection vanish everywhere. Obviously, the con- 
nection contains information on the space itself as well as the characteristics of 
the coordinate system. This is expected because the connection is derived from the 
metric tensor which defines the structure of the spacetime. 

18.2.4 

Rotation in Internal Space 

We learned in Chap. 9 that in order to describe physical phenomena we have to 
set up a coordinate system but that the phenomena themselves do not depend 
on which coordinates are adopted. An experiment carried out in Japan or in Ger- 
many should produce the same result, which is referred to as translational sym- 
metry. From it, Noether's theorem derives energy-momentum conservation. Sim- 
ilarly, whether the experimental setup is directed north or east, it should produce 
the same result. This is rotational symmetry, from which angular momentum con- 
servation is derived. Contrary to these examples, charge and isospin conservation 
are derived from gauge symmetry (a phase transformation invariance), which has 
nothing to do with real space-time. But the above logic suggests that a coordinate 
system can be set up in internal space, too, and the origin of the conservation law 
can be considered as a result of the physical phenomena being independent of how 
we choose the internal coordinate system. 

The simplest example is a charged particle that has two degrees of freedom, cor- 
responding to positive and negative charge. Let us denote the field components as 
ipi, tp 2 ', the field can be considered as a two-component vector in a certain internal 
space. Denoting the orthonormal base vectors in this space as e 1: e 2 , we can express 
the field as 

xp = xp l e 1 + \p 2 e 2 (18.15) 

If the physical phenomena do not depend on how we set up the coordinates, the 
laws of physics do not change if we rotate the coordinates. Denoting transformed 
ip as xp', by rotating the coordinate axis by an angle a we can write 



738 I 18 Gauge Theories 

Introducing the complex field by 

^ = (^i + i^ 2 )/V2, v f = (Vi " iVz)/V2 (18.17) 

we have 

^' = e -i "i/> , ^1"' = e ia )/jt (18.18) 

The rotation of a real field in two-dimensional space reduces to a phase transfor- 
mation in a one-dimensional complex space. Charge conservation results from the 
symmetry of the phase transformation, which is referred to as U(l) gauge symme- 
try. 

Now we imagine a hyperspace by combining (four-dimensional) space-time and 
internal space and consider the field xp a vector in the hyperspace. If the coordi- 
nates in the hyperspace are equivalents of the curvilinear coordinates, their direc- 
tion changes locally. If the rotation angle in the internal space depends on space- 
time coordinates a = a(x), it is a local gauge transformation. Then in analogy to 
Eq. (18.13) a parallel transported field in the hyperspace can be expressed as 

0>||(* + dx) = y>(x) - iqA v (x)ip(x)dx v (18.19) 

which introduces A v (x) as the connection. Here x is used to denote space-time 
variables in Minkowski space. As we will soon see, A v (x) is nothing but the gauge 
potential. We use the terms (gauge) potential for A^ and field strength for its 
derivative, i.e. V^ v = d u A„ — 3 F A„, which is the electromagnetic field in [7(1) 
gauge symmetry. When we refer to \p we call it the matter field to distinguish it 
from the gauge field. We have introduced the gauge potential not as a force field 
but as a "connection", which is a geometrical object relating vectors in different 
places. The equation is identical to Eq. (18.13) if we regard T v = iqA v as an oper- 
ator acting on a vector (ip) in an internal space. Unlike the Christoffel connection, 
which acts on a space-time vector (hence has two extra space-time indices), the con- 
nection A v has only one subscript in space-time and is a Lorentz vector because 
the field xp in Eq. (18.19) is a scalar in space-time. The extraction of the q factor 
is for later convenience and the reason for introducing the imaginary number i is 
because the rotation in the internal space is a phase transformation. 

Perhaps a simple example may lead to an intuitive understanding of the intro- 
duction of i. Consider the parallel transformation depicted in Fig. 18.2c as a vector 
defined in an internal space but moved from x to x + dx in space-time. Then we 
can introduce a vector in an internal complex plane 

V(x)= V r + iV B 

V\\(x+ dx)= Vj| r +iVj|e (1 = 2) V-idOV= [1 - iA,,(x)dx v ]V (18.20a) 
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The interpretation of the gauge potential as the connection in a hyperspace means 
the gauge potential connects the locally rotating coordinates in the internal space 
and contains in itself the properties of the hyperspace. Note, however, no metric is 
defined in the internal space. Unlike the Christoffel connection, which is derived 
from the metric, A v as the connection is a starting point for the discussion that 
follows. The assumption is only justified by overall consistency. 



Curvature of a Space 

Considering the role of the connection, we realize that it should contain informa- 
tion about the characteristics (or "curvedness") of a curved space. Let us consider 
how we can extract such information from the connection. Parallel transport of a 
vector in a fiat space is easy to understand intuitively, but that in a curved space 
takes some getting used to. Let us consider parallel transport of vectors on a sphere 
(in real space). How a vector moves on the sphere can be estimated by painting 
many parallel arrows on a flat plane and copying them on the sphere by rolling the 
sphere on it. Figure 18.4 illustrates two examples of vector transport. 



Deficit Angle a\ 



Deficit Angle 0E2 




Figure 18.4 Deficit angk 

denoted by 
outline type starting from the north pole (P) 
is parallel transported along the lines P^Q 
->■ R -*■ P, where Q, R are points on the equa- 
tor 90° apart. When it comes back to where 
it started, its direction is ji/2 different, which 
is referred to as the deficit angle. The total 
curvature is defined as the deficit angle in ra- 
dians and the average curvature by dividing 
it by the area enclosed by the loop, which is 
(jt/2)/(4jtR 2 /8) = 1/R 2 . Local curvature is 
obtained by making the closed loop infinitesi- 
mally small. 

ABCDE is a line along latitude = 30° (points 
A and E are identical). It is not a great circle, 



and the bli 

ported along the latitude 30° line, changes 
its direction relative to the local (longitude, 
latitude) coordinates as depicted. When it 
circles around and comes back its direction 
is opposite to the original direction. This can 
be explained by making a plane tangential to 
the line ABCDE (b) and spreading it out on 
a plane, which is shown in (c). The vector's 
direction is the same everywhere, but it is not 
the same relative to the line ABCDE. As the 
area of the sphere above the latitude 30° is 
jiR 2 and the deficit angle is Jt, the curvature is 
again 1/R 2 . 
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A conspicuous characteristic of a curved space is that if a vector is parallel trans- 
ported along a closed loop, its final direction is different from what it was initially. 
The difference is referred to as the "deficit angle". As the bending of the space 
grows, the deficit angle becomes larger. The total curvature of an area is the deficit 
angle expressed in radians and the average curvature is defined as that divided by 
the area. Local curvature is defined as the average curvature with the closed loop 
taken to be infinitesimal. 

As an example, let us consider a sphere with radius R. In Fig. 18.4a, a vector 
denoted by a white (outline typeface) arrow at the north pole is parallel transported 
to the equator P — >• Q along a longitude line, then moved along the equator to the 
east as far as 90° in longitude (Q — ► R), and finally back to the north pole (R — > P). 
The vector direction is shifted by jt/2 relative to the original. The area surrounded 
by the closed line P^Q^R^Pis jtR 2 /2. Therefore, the average curvature is 
(7t/2)/(jtR 2 /2) = 1/R 2 . 

For the case of parallel transport along a line at latitude 30°, one can see the 
vector rotates relative to the local coordinates as it moves. How it moves can be 
understood by spreading the tangential plane on a flat plane. As the plane tangen- 
tial to the 30° latitude line is a cone, one can see that the parallel arrows on the 
plane rotate relative to the latitude line (Fig. 18.4c). Points A and E are identical. As 
is clear from the figure, the two vectors at A and E are parallel on the spread-out 
sheet, but in opposite directions on the sphere, hence the deficit angle is n. 

In a similar manner, we can define curvature in a hyperspace by parallel trans- 
porting the field xp along a closed loop and by calculating the deficit angle. Let us 
consider the case when the field is transformed xp -*■ xp + 6 xp by circulating along 
a loop defined as the periphery of an infinitesimal surface element d S = dx f 'dx v 
(Fig. 18.5): 

dtp = -iq(j) A^,xpdx f ' 

= -iq I [V x A] n±ds xpdx f 'dx v ~ -iqF^, v xpdx f ' dx v (18.21a) 

Jds 

F fl „ = d fl A v -d v A fl (18.21b) 

where Stokes' theorem was used in obtaining the third equality. The rotation angle 
A a is given by 

e~ iha xp = xp + dip ~ (1- iAa)xp , .'. Aa = idxp/xp (18.22) 




Figure 18.5 Integral path for c 
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Therefore the curvature in the hyperspace is given by 



This shows that the electromagnetic field is nothing but the curvature of a hyper- 
space. The Maxwell equation [see Eq. (3.49)] 



drF"* = qj v 

e interpreted as defining equatio 



The Electromagnetic Field is the Curvature of a Hyperspace 

The existence of the electric charge (and its current) deforms the hyperspace 
and its curvature is defined by the Maxwell equation. 



Let us compare it with the equation of general relativity (Einstein's equation): 

R /IV - -Rg /lv = 8jiG N T flv (18.25) 

where G N is the gravitational constant and R ur and R = (g /iV R^) on the left 
hand side are referred to as Ricci's tensor. It is derived by contracting the Riemann 
tensor, the curvature of space-time. 4) This, in turn, is derived from the Christoffel 
connection by the same procedure as we used to derive the electromagnetic field. 
We remind the reader that the exact mathematical formula in differential geome- 
try is not important, rather, tracing the logical flow and trying to understand the 
cause and result are. As T fn , on the rhs is the energy-momentum tensor, Einstein's 
equation says that the energy distribution in space-time determines the curvature 
of space-time. One recognizes complete parallelism in the formalism of electro- 
magnetic and gravity equations. 

18.2.6 

Covariant Derivative 

A free-field equation is generally a second-rank differential equation, which is 
generically expressed as 

A(d /l )ip(x) = (18.27) 

4) Riemann's and Ricci's curvature tensors are The third term in the square brackets, which 

expressed as is nonlinear in f, stems from the nonabelian 



Rp fir = - [d fl r v - d,,r tl + (r f ,r„ - r„r ft )X a 

R„v = R'„„ , R>" = g""g™ R„a , g au g e field I see E q- ( 18 - 5 ?) 

R = R^ l=r R pll (18.26) m Sect. 18.4.3]. 



of the Christoffel connection. Notice it 
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where A (a) is a polynomial of a up to quadratic terms. We need to define the 
differential operator in a general coordinate frame. When a field ip(x) is defined at 
all space-time points, its derivative is normally defined as 

dip(x) , Axp , ^(x + Ax)-i*M 

-^ = lim — ^- = lim yl ' ^-^ (18.28) 

For a vector field, the above formula applies to each component. The difference 
for a vector field as compared to a scalar field is that, when making difference A xp 
between xp(x + Ax) and xp(x), the feet of both vectors have to be at the same 
space-time point. In the Cartesian coordinate system, one need not worry about 
this, because the direction of a vector or equivalently all components of the vector 
are the same anywhere in space. This is not true in curvilinear coordinates, be- 
cause the same vector has different components at different space points. In this 
case, the vector at x has to be parallel transported to x + Ax without changing its 
magnitude and direction (Fig. 18.6). We already know the expression for a vector 
parallel transported from x to x + Ax. It is given in Eq. (18.13). To differentiate xp, 
one has to make a difference Axp by subtracting xp\\ (x + Ax) from ip(x + Ax) as 
shown in Fig. 18.6. 

D fl xp(x) = A lim o ^^l = 8 /l1 p(x) + r /l (x)ip(x) (18.29a) 

which is referred as covariant derivative. Here, we illustrated xp as a vector in the 
spacetime and used Eq. (18.13) to define the derivative as given in the differential 
geometry. Similarly, to differentiate xp in the curved hyperspace, we use Eq. (18.19). 
Then its derivative can be defined as 

PVW _ ,,„ xp(x" + Axn-xP\\(x" + Ax>-) 



l - + iqA(x) li (x)rp(x) (18.29b) 

which is precisely the "covariant derivative" we defined in Eq. (18.9). The covari- 
ant derivative contains the connection, which contains the characteristics of the 
coordinate system as well as the structure of the space. 

According to differential geometry, in curvilinear coordinates the same equation 
holds anywhere in the space if one uses covariant derivatives instead of ordinary 
differential operators. Phrased in a different way, an equation expressed in terms 
of covariant derivatives keeps the same form under any general coordinate trans- 
formation provided a suitable connection is used. This is covariance under general 
coordinate transformation or the principle of general relativity. The principle itself 
is a matter of geometry and has no physical substance. The physics lies in identify- 
ing the space-time nature has chosen and the equation of motion in that space. The 
former is realized by Einstein's equation and the equation of motion is obtained 
from the principle of equivalence. We already have the equivalent of the Einstein 
equation in the hyperspace. Therefore, our next task is to translate the equivalence 
principle to the language of the gauge theory. 
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Figure 18.6 Parallel transport and 



is denoted as ip\\(x + Ax). / 



Parallel transport of a vector in derivative in curvilinear coordinates is defined 

oordinates is defined by u; ' 



. ingthe aslim Aj( ^o[V(* + A*)-Vll(* + Ax)]/Ax. 
r„". tp(x) parallel transported to 

18.2.7 

Principle of Equivalence 

The principle of equivalence identifies gravity as the inertial force in an accelerat- 
ing system. In an inertial frame, there is no gravity and the equation of motion 
is described by Newton's law (dp/dt = 0) or its extension to special relativity 
(dp/dr = 0, t = proper time). Since the equation expressed in terms of covari- 
ant derivatives keeps the same form if transformed to an accelerating frame, the 
equation of motion under the gravitational field is expressed as 



Principle of Equivalence 








Eq. of motion in inertial frame 




Dp" 

dr 


n curved space 
= 

(18.30) 



This means that the trajectory that a point particle follows is a geodesic line. We 
have now presented the mathematical formula of what we described pictorially 
in Fig. 18.1. A particle A under the effect of another mass B follows the trajec- 
tory of a geodesic line. In order for the above equation to hold, one has to verify 
the existence of the inertial frame in an accelerating frame. Thinking physical- 
ly, it is always possible by getting in an elevator and cutting the suspension wire 
(Fig. 18.7a). The elevator goes into free fall and no gravity exists inside it. If one 
feels bad about getting in a falling object, a satellite would provide the same inertial 
frame. In mathematical language, there is always a transformation that makes the 
connection vanish at a point. The vanishing of the o 
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(a) (b) 

Figure 18.7 Principle of equivalence, (a) In a 
gravity field, space-time is curved, but there al- 
ways exists a coordinate transformation where 
"the connection" vanishes at a point. Physi- 
cally, it can be realized by entering an elevator 
and cutting its suspension wire. The equation 
of motion in the elevator is that of the inertial 
frame, i.e. dp/dr = 0, where t is proper time. 
Outside the elevator it is obtained by replac- 



To the Center of Mass 



ing the derivative with its covariant version 
and transforming back to the original acceler- 
ating frame, (b) An illustration of the fact that 

area containing A and B attracted by the earth 
(a spherical gravitational center), one sees 
two mass points are attracted to each other 
without any force, which means the area is not 



of Cartesian (Minkowskian in the case of space- time) coordinates, which, in turn, 
means an inertial frame. Notice that a local inertial frame means the vanishing of 
the connection at a point mathematically, but physically it includes its immediate 
neighborhood as long as the higher order terms in the Taylor expansion can be 
neglected. If one considers a wide area such as that depicted in Fig. 18.7b, one sees 
two mass points A and B come closer as time elapses, which means the area is not 
an inertial frame. 

What is meant by "local"? For radius < 10 m, a circular accelerator can be built 
assuming the earth is flat. But when the radius of the acceleration circle is on the 
order of kilometers, the curvature of the earth's surface can no longer be neglected 
in aligning magnets. Here, locality is limited only to ~ 100 m. Do the dynamics of 
galaxy formation or even the formation of a cluster of galaxies need to take into ac- 
count cosmic curvature effect? Here locality applies to a region as large as millions 
of light years. Locality changes depending on the kind of observables we are talking 
about. 

Going back to particle field theories, in the absence of an interaction, the mat- 
ter field equation can be derived from free Lagrangians. We already know them, 
the Klein-Gordon, Dirac or Maxwell (or Proca if massive) equations for spin 0, 1/2 
or 1 fields. They are the equivalent of the law of inertia in gravity. General rela- 
tivity means covariance of the equation under general coordinate transformation. 
In gauge theory, it means covariance under local gauge transformation. In both 
cases it is achieved if the equation is written in terms of the covariant derivatives. 
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The principle of equivalence tells us that by replacing the derivatives in the free 
Lagrangian by the covariant derivatives, the Lagrangian in curved space, namely, 
that of interacting fields, is obtained. This is precisely the process of gauging we 
mentioned before. 



Priniciple of Equivalence in Gauge Theory 

Eq. of free field Eq. of interacting field 

A(d fl )Tp[x) = Q -> A(D /i )y(x) = 



The existence of an inertial frame in general relativity is translated to gauge the- 
ory as follows. No matter what internal coordinate [gauge = phase a(x)\ is used, 
there always exists a coordinate (gauge) transformation such that the connection 
(gauge potential) vanishes at at least one point. The requirement is expressed as 

A fl (x) -+ A' fl (x) = A fl + d„a(x) = (18.32) 

Let us see if there is a gauge transformation that makes the gauge field vanish at 
every space-time point. The condition that Eq. (18.32) can be solved is for a to be 
integrable, namely 

d fl d v a(x) = dyd^aix) (18.33) 

Combining this with Eq. (18.21b) gives 

F f ,v = dftAyix) - d v A fl (x) = (18.34) 

Namely, the nonexistence of the electromagnetic field is the condition for the van- 
ishing of the gauge potential everywhere. Since, in hyperspace, the electromagnetic 
field represents its curvature, and the vanishing of the gauge potential everywhere 
means a flat hyperspace, it is logical to require the electromagnetic field to vanish 
identically. 



General Relativity and Gauge Theory 

Let us rephrase the theory of general relativity in the language of gauge theories. 

In an inertial frame, Lorentz (or more generally Poincare) invariance holds. 
The Lorentz transformation [Eq. (3.76)] has the form of a phase transforma- 
tion, and contains 6 (10) parameters, the velocity and rotation angle (and dis- 
placement). They are constants and the transformation could be dubbed a 
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global gauge transformation. If the transformation is made local, i.e. if the 
parameters are functions of space-time points, it is a general coordinate trans- 
formation. The principle of general relativity is rephrased as invariance under 
local Poincare transformation. It requires that the equation of motion does not 
change its form. The principle of general relativity is nothing but invariance 
under local gauge transformation. 



A big difference between gravitational theory and gauge theory is that, in gauge 
theory, parallel transport is activated in an internal space and the connection has 
only one subscript as far as space-time components are concerned. This is why 
the gauge field is a Lorentz vector. Hence the gauge particle has spin s = 1. If the 



Table 18.1 Correspondence between gauge theory and general relativity. 







Gauge Theory 


General Relativity 




frame 


internal space 


space-time 




coordinate trf. 


gauge trf. 


Poincare trf. 




invariants 


charge 




Symmetry 


eq. motion 


KG/Dirac eq. a 


inertial law 






A(d fl )xp = 


d -£=° b 




coordinate trf. 


local gauge trf. 


local Poincare trf. 

or general coordinate trf. 




connection 


gauge potential 


Christoffel connection 






iqA fl (x) 


r fl = r A Vfl 




parallel trpt. 


V — iqApTpdxt 1 


P l -r l fn ,pf'dx v 




cov. derivative 


D fl = d fl + iqA fl 


D flP * = d^ v >- + r>- llv p v 




curvature 


electromag. field 


Riemann's curvature 






v 


R uvp 




origin of curv. 


charge 




Local 


field equation 


Maxwell eq. 


Einstein eq. 


Symmetry 




d fl F'"' = q j v 


R'"" - -Re"" = 8jtG n T'"' 

„ u. 2 


(curved space) 


eq. motion 


A(D fl )y> = 


¥=° 




equivalence 


potential can 


r u can vanish locally 






vanish locally 


(existence of 
local inertial frame) 




total curv. = 




existence of 




(flat space) 


A/t = globally 


Cartesian coordinates 
r /( = globally 


a KG = Klein-Gordon 
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hyperspace is curved, the Lorentz invariance is preserved. But in gravity, application 
of a general coordinate transformation breaks Lorentz invariance, which means the 
space-time is curved. Since the object to be parallel transported is a Lorentz vector, 
the (Christoffel) connection has three indices for space-time components; one as 
a vector in spacetime, two as a matrix acting on spacetime vector to be parallel 
transported. Since the connection for gravity is derived as derivatives of the metric 
tensor of rank 2 that gives the gravitational potential, the graviton, the quantum of 
the gravitational field, has spin 2. As a summary of what we have described so far, 
we list correspondences between gauge and gravity fields in Table 18.1. 



Are All the Fundamental Forces Geometrical? 

The geometrical interpretation of the gauge force we have presented is not 
rigorous but by no means accidental. In 1956, Utiyama proved that general 
relativity is a kind of gauge theory [376]. There is also Kaluza-Klein theory, 
which was proposed in 1921. Working in five-dimensional space-time, they 
showed that the electromagnetic force can be derived from general relativity 
as the force produced by the energy-momentum in the fifth dimension and 
projected in four-dimensional space-time. In their model, the fifth dimension 
is not infinitely long but curled to form a cylinder, making the space trans- 
lation in the fifth dimension periodic, which is essentially an act of gauge 
transformation. The electromagnetic potential appears as the metric tensor 
connecting the fifth dimension with ordinary four-dimensional space-time. 
The electric charge is proportional to the fifth component of the momentum. 
Just as all three forces - weak, electromagnetic and strong - are described 
by local gauge theories based on S U(3) co \ OI x S U(2) L x U(l) Y symmetry, it is 
known that quantum gravity can be obtained by gauging the supersymmetry 
that connects fermions and bosons. This is referred to as supergravity. How- 
ever, a self-consistent method of quantizing the gravitational field is not yet 
known. One idea that is considered promising is to regard elementary parti- 
cles not as points but as strings. Superstring theory tries to unify all four forces 
by applying the gauge principle to supersymmetric strings in ten-dimensional 
space-time. It is also a generalized Kaluza-Klein theory, extending general rel- 
ativity to larger dimensions. Whether the extra six-dimensional space is curled 
up as Kaluza and Klein suggested or whether we live in a brane world float- 
ing in a vast ten-dimensional space is a hot topic currently being discussed. A 
possible way of detecting the extra dimension has been suggested and could 
be tested by the Large Hadron Collider (LHC) experiment. 
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Aharonov-Bohm Effect 



The gauge potential has a freedom of gauge transformation, which casts doubt on 
it being an observable. In classical electromagnetism, it is merely a mathematical 
entity, useful but of no physical significance. The electromagnetic field f v , on 
the other hand, is gauge invariant and physical. However, in quantum mechanics, 
in describing the electromagnetic interaction in the Schrodinger equation, we use 
A ft = ((p/c, A), which suggests the potential plays a fundamental role. In fact, in 
quantum mechanics, A fl is an observable and plays a more fundamental role than 
the electromagnetic field. Its role was suggested in the geometrical interpretation 
of the gauge fields, but is most clearly demonstrated by the Aharonov-Bohm ef- 
fect [8]. 5) It is the effect that the gauge potential causes in a well-known double slit 
experiment with electrons (see Fig. 18.8). 

We first describe the electron wave's behavior without an electromagnetic field. 
The wave function at point R can be obtained from that at P by the following pro- 
cedure. Starting from P, electrons reach R either through slit Q or through slit S. 
Denote the two paths as C\, Cj, the two wave functions at R as \p\, \p 2 and the 
phases that \p\, ip 2 have acquired as a.\, a 2 - The particle distribution at R is given 
by 

W 2 = IVi + n>i\ 2 = IV2I 2 11 + exp{-i( ai - o 2 )}|Vi/V2l| 2 (18-35) 

From the expression, one sees that the absolute value of the phase is not an observ- 
able but its difference is. Next, we place a confined solenoidal magnetic field (per- 
pendicular to the plane) at a place far away from the electron path. If the solenoid 
is made slim and long enough that no magnetic field leaks out, the electrons never 
feel the field. Then classically the solenoid should have no effect on the electron 
path. However, observation shows that the interference pattern changes when the 
field is turned on. The effect is referred to as the Aharonov-Bohm effect. 

The reason is as follows. Even if the field strength vanishes, the potential need 
not. The condition is B = V x A(x) = 0, which restricts the allowed functional 
form of the potential outside the solenoid as 

A(x) = -Vf(x) (18.36a) 

Turning on the magnetic field produces the gauge potential, which transforms the 
wave function to 

V -> e- ift Wy>i(x) (18.36b) 

Here, we attached suffix i to differentiate phases picked up by electrons in the 
two paths. One should note that this is the requirement of the gauge principle. 
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Figure 18.8 (a) Aharonov-Bohm effect. A 
double-slit experiment can demonstrate clear- 
ly the wave nature of the electron beam. If a 
confined solenoidal magnetic field is placed 
far away from the electron paths, the interfer- 
ence pattern changes. Although the magnetic 
field vanishes outside the solenoid, the poten- 



tial does not. This is clear proof of the gauge 
potential affecting the electron's motion, (b) 
Magnetic flux quantization. When there is a 
superconducting ring current, the magnetic 
field is repelled from the superconductor and 
the flux inside the ring is quantized, showing 
another role of the gauge potential. 



Turning on the magnetic field means the gauge potential is produced by a gauge 
transformation that accompanies the corresponding phase transformation in the 
matter field. Compare Eqs. (18.6) and (18.7). Solving Eq. (18.36a) for f(x) we can 
express _f; as a line integral 



/;(*) = 



q L, c A{ ' 



(18.36c) 



Substituting Eqs. (18.36b) and (18.36c) into Eq. (18.35), we have a modified electron 
distribution with the magnetic field on: 



IVs| 2 = IV2| 2 l + expj-i ? ^A-dx-i(a 1 -a 2 )j 



(18.36d) 
(18.36e) 



§ is a line integral along a closed loop made by Ci and C 2 . If there is no magnetic 
field inside the loop, the integral vanishes and the interference pattern does not 
change. But if there is a field somewhere inside the loop, the magnetic flux in the 
loop does not vanish. Namely, 







-f Bnd S = ^ 



A-dx^O 



(18.37) 



where Stokes' theorem was used in going from the second to the third equation. 
Despite the fact that the electron never passes a region where the magnetic field 
exists, it perceives the field and the interference pattern changes. The prediction of 
Eq. (18.36d) has been experimentally confirmed [303, 306]. The Aharonov-Bohm 
effect with an electric field is also observed [377]. Since the electron never sees the 
magnetic field, it must be the gauge field that is acting on the electron. Thus the 
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Aharonov-Bohm effect has clearly established that the gauge potential is real and 
physical and that the electromagnetic field F llv alone is not enough to describe 
the electromagnetic phenomena. Since F flv can be derived from the potential by 
differentiation, the potential is a physical object more fundamental than the field. 

Note, this shows that the gauge potential itself is not an observable, but its dif- 
ference is. When it manifests itself as an observable, the gauge invariance is main- 
tained. Notice that the line integral defined in Eq. (18.37) is gauge invariant if the 
potential itself can be varied locally. 

Another interesting example related to the Aharonov-Bohm effect is the super- 
conducting ring current in the presence of a magnetic field [Fig. 18.8]. Owing to 
the Meissner effect, the magnetic field cannot exist in the superconductor. How- 
ever, the electron wave function acquires a phase given by Eq. (18.37) every time 
it goes around the ring. The requirement that the wave function be single valued 
forces the magnetic flux to be quantized: 

<P = 2jz — n = 2.067833636 xlO" 15 Wbxn , n = 0, ±1, ±2,- •• (18.38) 

This is also experimentally confirmed [118, 218] and has wide applications, for 
example in the Josephson junction. The factor two in the denominator is related 
to the fact that the supercurrent is produced by condensed Cooper pairs, which 
carry charge — 2e. 6) 

In the geometrical interpretation of the gauge potential, the Aharonov-Bohm ef- 
fect is easy to understand. The magnetic flux is the total curvature of the closed area. 
Consider a cone with its apex rounded (see Fig. 18.9a). The round area is where the 
magnetic field exists and the space (two-dimensional plane in the figure) is curved, 
but the other area is flat in the sense that the local curvature vanishes. This is eas- 
ily seen by cutting the cone and spreading it out on a flat plane (Fig. 18.9b). The 
side plane of the cone is tangential to the flat plane and local curvature vanishes 
everywhere. Both paths ABC and ADE are straight lines in the flat space locally and 
the direction keeps its original value. Namely, the electron on the path never expe- 
riences the magnetic field, but at point A(= E) where the two paths meet, there is 
a deficit angle. Geometrically, the local curvature always vanishes, but the total cur- 
vature does not if the loop surrounds the apex area. If the loop does not contain the 
apex area, the total curvature, hence the deficit angle, vanishes. The electron per- 
ceives the total curvature, which is equivalent to feeling the electromagnetic force, 
as the global behavior of the space geometry. 

Notice that the gauge potential is nonzero outside the solenoid. As we stated in 
Sect. 18.2.7, the potential can be made to vanish everywhere only when the electro- 
magnetic field vanishes identically. If it exists somewhere, no matter how far sep- 

6) The Cooper pair consists of two electrons temperature a great number of Cooper pairs 

with opposite momentum and spin. They are produced (Bose-Einstein condensation) 

attract each other through phonon (the and act coherently, making a quantum fluid 

quantum of lattice vibration) exchange and of electrons. It is an example of spontaneous 

make a bound state, which is the Cooper symmetry breaking, which will be discussed 

pair. Normally, thermal motion destroys later in this chapter, 
the bound states, but below a critical 
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Figure 18.9 An example where the effect of 
the gauge potential A fl is manifest. The elec- 
tromagnetic field is interpreted as the curva- 
ture of a hyperspace. A particle passing where 
no magnetic field exists (a flat plane) feels its 



effect when the space is globally curved like 
the surface of a cone. The two particle's trajec 
tories are ABC and ADE, and when they meet 
at C (= E), their direction is different. 



arated, it deforms the space and no Cartesian coordinates can exist. The geodesic 
line in the curved space looks bent if projected on the Cartesian coordinates, mean- 
ing the electron perceives the electromagnetic force. 

Considering the role of the gauge potential as the connection, which is locally 
defined but also contains global information about the entire space, the connec- 
tion's controlling power must prevail throughout the whole space. Therefore, its 
quantum must have vanishing mass, otherwise its control would not reach far be- 
yond its Compton wavelength. Thus, the geometrical interpretation of gauge theory 
provides a natural explanation of the vanishing mass of the force quantum. 



Problem 18.1 

Derive the vector potential outside a solenoid of radius a. 

An Experiment to Prove the Aharonov-Bohm Effect The Aharonov-Bohm effect 
touches fundamental issues in quantum mechanics. Experimental tests of the 
Aharonov-Bohm effect were carried out immediately after their proposal (for a 
review see [303]) and confirmed it. However, with the emergence of the gauge 
principle as a successful axiom underlying all the forces, many subtle theoretical 
as well as experimental questions were raised as to the validity of the effect. The- 
oretically, locality, reality of the gauge potential and the single valuedness of the 
wave function, etc. were at issue. A major complaint on the experimental side was 
that the effect can be explained by leakage of the magnetic field. 

Here, we introduce an experiment to prove the Aharonov-Bohm effect that re- 
sponded to the criticism [306, 367, 368]. A solenoid or magnetized whiskers, how- 
ever long and confined, have an outer field in order to close the flow of the magnet- 
ic flux. Consequently, the experiment was carried out on a toroidal magnet using 
electron holography. A schematic layout for the hologram formation and its re- 
construction method are depicted in Fig. 18.10a,b. A specimen (toroidal magnet) 
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Figure 18.10 (a) Schematic diagram of the 
electron optical system for hologram forma- 
tion. An image of the specimen is formed on 
a photographic film. A biprism is employed tc 
cause the objective wave and reference wave 
to overlap and form a hologram, (b) Schemat 
ic diagram of hologram reconstruction. Col- 



Mach Zehnder Hologram Into 
Interferometer Image 

(b) 

limated laser light is split into two beams 
and both illuminate the hologram. One (ABC 
path) reconstructs the image, which is su- 
perposed with the other transmitted through 
the ADC path (comparison beam) to produce 
an interference micrograph. Figures adapted 
from [306]. 



was illuminated by an electron beam. Its image was magnified and recorded on 
a photographic film. Only half of the beam was used. The other half was used as 
a reference beam, which was overlapped with the image by a biprism to form the 
hologram. To reconstruct the original image from the hologram, two laser beams 
separated by a half-silvered mirror were used. One, which goes along the path ABC 
in the figure, reconstructs the image, focused on the observation plane. Note that 
the beam splits into three after going through the hologram, one is transmitted 
straight forward and the two others are the image beam and its conjugate emitted 
at angles tilted symmetrically to the transmitted one. The other beam (path ADC) 
was used as the comparison beam. It also splits into three, only the transmitted 
beam is superposed on the observation plane to form the interference images. Two 
types of interference images were used: a contour map and an interferogram. The 
former is obtained by setting the reconstructed and transmitted waves parallel, as 
illustrated in Fig. 18.10b. The latter, i.e. the interferogram, is obtained by tilting the 
two waves to each other (~ 10 -3 rad). 

Figure 18.11a is a contour map illustrating magnetic flux lines. There, a constant 
flux flow of h/e lies between two adjacent lines. An interferogram showing wave 
fronts and a schematic illustration are given in Fig. 18.11b and c. The photograph 
reveals that phase differences exist between two electron beams that have passed 
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Figure 18.11 (a-c) An electron beam illumi- 
nated the toroid from above, (a) A contour 
map showing magnetic flux lines, (b) Inter- 
ference pattern showing the retarded phase 
of the electrons going through the hole, (c) 
Illustration of the wave front, (d) Interference 



pattern made with a Permalloy toroid wrapp 
with superconducting Nb to prevent magnel 
flux leakage. The bridge stretching upward is 
thermal contact to control the temperature c 
the toroid [367, 368]. 



through the inner and outer spaces of a toroidal magnet, where there are no mag- 
netic fields. In addition, from (b), the phase displacement was counted as 5.52, 
agreeing with the theoretical value of 62 to within 20%. However, this interfero- 
gram is still taken with part of the electrons going through the magnetic field. 

The third picture was taken with a toroid completely shielded by superconduct- 
ing material. The toroid is made of ferromagnetic Permalloy sandwiched between 
two Si0 2 substrates and surrounded by niobium (Nb). Below the transition tem- 
perature T c , the niobium becomes superconducting and repels the magnetic field 
completely by the Meissner effect, thus eliminating it inside the toroid. Its pres- 
ence, moreover, quantizes the magnetic flux = n(h/2e), n = integer. Shielding 
makes the interference stripes invisible on the toroid, and only relative displace- 
ments of wave fronts modulo the wavelength can be detected. For the supercon- 
ducting phase (T < T c ), the phase shift is exactly for n even and jt for n odd. The 
picture shows a case for n odd (jt). One can see the inside stripe is exactly in the 
middle of two outside stripes. 

Thus, the experiment has established the Aharonov-Bohm effect firmly and con- 
sequently the physical role of the gauge potential. If it is not accepted, nonlocality 
of the magnetic field has to be argued. The existence of quantized flux touches 
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another fundamental issue in quantum mechanics, i.e., single valuedness of w 
functions. 



18.4 

Nonabelian Gauge Theories 



Isospin Operator 

Up to now, we have regarded the internal space as a one-dimensional complex 
space. Our arguments in Sect. 18.2 can be extended directly to an n-dimensional 
complex space. The major difference is that for n > 2 two consecutive coordinate 
transformations (C = AB) are different if the order is reversed (C = BA / AB). 
Because of this, the field strength and its equation become nonlinear, making 
mathematical manipulations complicated. Extension of the formalism to n = 2 
complex space is to introduce S 1/(2) (isospin) symmetry. In Chap. 9, we introduced 
isospin of the strong interaction by considering u -h- d symmetry or, more precise- 
ly, by requiring the equation of motion to be invariant for unitary transformations: 



>]K 



(18.39) 
Without loss of generality, the unitary matrix can be expressed as 



p( -,i>4 



which has the form of phase transformations. In general, S U(n) transformation 
can be expressed in this form using n 2 — 1 traceless hermitian matrices called the 
generators of the group. When t a are noncommutative, the representation matri- 
ces U are noncommutative, too, and the group is referred to as nonabelian. The 
group characteristics are essentially 7 ' determined by their commutation relation 
(Lie algebra), which is expressed as 

[*.,*»]= ^,ifubctc = ifabctc (18-41) 

where f a t, c are completely antisymmetric constants referred to as structure con- 
stants of the group. For the case of S U(2), t a = r a /2 holds, where t are Pauli 
matrices and 

fabc = £ijk SU(2) (18.42) 

7) Meaning those of local nature. Those of global nature, such as connectedness, may be different. 
For instance, S 17(2) and SO(3) share the same structure constants but there is 1:2 correspondence 
between the two (see Sect. A.2 in Appendix A). 
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In an N-dimensional representation space spanned by N-component vectors, U 
are N x N unitary matrices. When N = n, the matrices are referred to as the 
fundamental representation, and if N = n 2 — 1 they are referred to as the adjoint 
representation. In the following, we limit our discussion to S U(2), but conclusions 
are applicable to any SU(n). 

If the Lagrangian is invariant under S 1/(2) transformation of the form Eq. (18.40), 
where a a 's (a = 1 ~ 3) are constants, Noether's theorem predicts 
a conserved isospin current for the matter field with isospin I: 



where N = 2 1 + 1 and ip is a vector in isospin space. The generators t a are N x N 
matrices. The space integral of Eq. (18.43) 

l u = fd 3 xJl(x) (18.44) 

defines a conserved operator, which is referred to as isospin. In the following, we 
sometimes refer to it as isospin charge, meaning it generates the force field just as 
the electric charge, which is a conserved observable, generates the electromagnetic 
force field. Note that a field having 1 = does not generate a force field just as 
the neutral charge does not generate the electric field. The isospins are operators 
to generate isospin rotations (S U(2) phase transformations), i.e. 

e ia-i^ e -ia-i = exp [_ ia . t ^p (18.45) 

which is the global gauge transformation of S 1/(2). The formalism we developed in 
the previous section can be directly applied. Since S U(2) transformation is math- 
ematically almost identical to three-dimensional space rotation, our analogy of the 
gauge transformation to a rotation in the internal space is intuitively more appeal- 
ing. By gauging the S U(2) transformation, or equivalently replacing the derivatives 
with the covariant derivatives, we can obtain the Lagrangian of interacting fields. 



Gauge Potential 

From now on, we consider matter fields belonging to I = 1/2 or the fundamental 
representations of S U(2), as is the case for the weak interaction of quarks and lep- 
tons, though the following discussions are valid for any isospin value. The gauge 
potential of S 1/(2) can be introduced by directly applying Eq. (18.19). Since there 
are three (n 2 — 1 in general) parameters in Eq. (18.40) for gauge transformation of 
the matter field, we need three gauge potentials to keep the local gauge transfor- 
mation invariant. This requirement restricts the form of the gauge potential. All 
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we have to do is to replace A /l (x) by 

A„(x) -* [*„(*)] (J = X) A «>«(*)[*Jij = A- " Mij (18-46) 

which are 2x2 matrices acting on the two-component matter field \p . The infinites- 
imal parallel transport of the matter field is defined as 

V||(s + dx) = ip(x) - igA^ip = ip- igA u • fl- 
orin components Vll >(* + dx ) = ipii*) - igApix) ■ [t]ijipj(x) 

(18.47) 

where g is the coupling constant corresponding to e in QED. The corresponding 
covariant derivative can be defined as 

[D„]ij = [9„ + igA fl ]ij = dijd,, + igAu ■ hj (18.48) 

The gauge transformation of the potential can be obtained by the requirement that 
it keeps the equation of motion invariant when the matter field undergoes a gauge 
transformation 

xp(x) -+ ip'(x) = U(x)?(x) = exp[-ig«(x) • t]xp(x) (18.49) 

where we have extracted the factor g from the phase for later convenience. If we 
denote the gauge-transformed covariant derivative of D fl as D' , the Lagrangian is 
kept invariant if 

id„v)' = D ' f y = D '4 u v) = u ( D ^) 18 50 

.-. D' /t = UDuU' 1 
which defines the transformation property of the gauge field: 

A' = UA U I/" 1 - - Ud„ L/" 1 (18.51) 

' ' g 

Setting [/ as an infinitesimal transformation (a small), we have 

A 'u fl = A «« + fabca b A c/l + d u a a {x) + ••• (18.52) 

which does not depend on the representation matrices t a . Adopting vector notation 
in the internal space, we can express the above relation also as 

A;, = A fl + a x A„ + d^aix) + ■■■ 

(«xA,) 8 = X:/«^A c , < 18 - 53 > 

b,c 

Since, in quantized field theory, the gauge potential is a field on an equal foot- 
ing with matter fields, we refer to the gauge potential as the gauge field here. The 
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g of the equation is that the gauge field is rotated in the internal space, 
which means that the gauge field can be phase transformed by the isospin opera- 
tor given in Eq. (18.44). This is possible only when the gauge field carries isospin, 
hence the gauge field itself is the source of the force. This feature did not exist 
in U(l) gauge symmetry. In other words, while the photon is neutral and cannot 
generate the electromagnetic field by itself, the S U(2) gauge field carries isospin 
charge and can be the source of the force field. The isospin of the gauge field is 
1 = 1 since there are three components. As the action of the force is represented 
by coupling to the gauge field in the Lagrangian, the interaction of the S U(2) gauge 
field is nonlinear, which is the conspicuous characteristic of nonabelian gauge the- 



Isospin Force Field and Equation of Motion 

The curvature of the isospin hyperspace can be obtained by the same procedure as 
for V^y in QED. But this time the gauge field are matrices and noncommutative, 
which means the order of parallel transport has to be chosen carefully. Let us move 
the matter field in the x — x f ',y = x" plane and take the same infinitesimally 
small loop as was done for the electromagnetic field (Fig. 18.5). 
Omitting variables other than x, y, we have 

1. Parallel transport from P -► Q: 

V||(* + dx, y) = [1 - igA x (x, y)dx]ip(x, y) (18.54a) 

2. Parallel transport from Q — ► R: 

V|| (x + dx,y + dy) = [1 - igA y (x + dx, y)dy]ip\\(x + dx, y) (18.54b) 

Denoting the parallel-transported ip via route P — >• Q — >• R as ip,, , we find 

< QR = [i - igM x + dx ' y) d yP - <*M*. y)i*M*, y) 

~ [1 - ig{A y (x, y)dy + A x (x, y)dx + d x A y (x, y)dxdy} 
- g 2 A y (x, y)A x (x, y)dxdy]ip(x, y) (18.55) 

The transport R ->■ S -*■ P can be obtained from the transport P ->■ S -> R by 
changing the sign of the path and the transport P ->■ S -*■ R by exchanging x -o- y 
in ip* QR in Eq. (18.55). Namely, 



ig{A x A y — A y A x )]ip(x, y)dxdy 
(18.56) 
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Comparing this with the defining equation of the curvature Eq. (18.21a), we obtain 
the field strength 



dpAv - B v A /l + ig[A^„ A v ] = — [D„, D v ] 



(18.57) 



The second equality can be proved by actually performing the arithmetic of the 
commutator. It is an operator relation, but one sees that it does not contain any 
derivative after taking the commutator. For an abelian group [A^, A v ] = and 
Eq. (18.57) reduces to the electromagnetic field. 

Problem 18.2 

Derive the second equality of Eq. (18.57). 

[A F , A v ] = [A afl t a , A hv t h ] = if a bcA afl A hv t c = i(A fl x A v ) ■ t (18.58) 
5^ v can also be expanded in terms of t a : 



F^ = 3,, A v - d v A^, - g\ x A v 
The gauge transformation of !F flv can be obtained from Eq. (18.57) 
•F' = — \d' D'] = — U\Du, D v ] U _1 = UFuvU' 1 



(18.59a) 
(18.59b) 



As F /lv is a matrix, F^ / F flv , i.e. the nonabelian field strength F^ v is not gauge 
invariant. When one wants to construct a gauge-invariant Lagrangian, one can use 



To obtain the derivative Dx J^ v , one can choose the path depicted in Fig. 18.12. 
The result is 




Figure 18.12 Integral path for calculating cc 



I 
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Then the equivalent of the Maxwell equation becomes 
D u F liV = d ll F'' v + iglAu, F' ,v } = gi v 

v_ v _ -v < 18 ' 63 ) 

or, separating the representation matrix, 

d fl F^" - gA fl x P lv = gj " (18.64) 

which can also be obtained from the Lagrangian 

£su { 2) = V(i/"0 /( - m)V - \f /iv ■ F"" (18.65) 

Problem 18.3 
Derive Eq. (18.62). 

Problem 18.4 

Derive Eq. (18.64) from the Lagrangian in Eq. (18.65). 

Substituting Eq. (18.62) in the Jacobi identity 

[D /ly [D v , D x ]] + [D v , [D x , D,,]} + [D x , [D u , D v \] = (18.66) 

we have the S U(2) equivalent of the sourceless Maxwell equation (3.49b): 

D^a + D v y Xli + D X F !IV = (18.67) 

Thus, we have derived a S U(2) gauge-invariant Lagrangian and the associated 
equations of motion. The Lagrangian Eq. (18.65) has exactly the same form as that 
of QED, except that the matter field xp is an I = 1/2 doublet and the gauge field 
Ap is 2 x 2 matrices acting on xp and an I = 1 triplet. The major difference lies in 
that Fp V contains nonlinear (quadratic) term and hence the Lagrangian contains 
up to quartic terms in the gauge fields. 

So far we have discussed only S U(2) gauge theory. Extension to general S U(n) 
is straightforward. The only difference is that there exist n 2 — 1 generators and 
the same number of gauge fields and that the structure constant e a bc has to be 
replaced by f a b c . We summarize the equations of motion for the I = 1/2 matter 
field and the I = 1 gauge field, which respect S 1/(2) gauge invariance. Expecting 
applications to the weak interaction, we denote the S (7(2) gauge field as Wa and 
its coupling strength as g w . 
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SU(2) Lagrangian 














£su { 2) = v[iy" 


^ + ig w W f 


■i 1 )- 
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-F, 
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W 1 ' = (w", 


W 2 ", W/ 








(18.68) 


F£ v = d u W* - 9 


W£ - g w e ah 


wg w c v 










[?■?]- 


eahc Y 










(18.69) 


The interaction LagrangiE 
and neutral bosons as 


n can alternatively be expressed in 


terms of charged 


-£int = gv^y' 


w„ ■ T -y 












- l ff 
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- W-] V 


+ g 


' V V 
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(18.70) 


The equations of motion 


are 












(iyfd fl -m)ip = 


jw-W-(T/2)V 










(18.71a) 


d fl F f,v -gWf, x f 


^ = g w ^"y" 


»/2)V 








(18.71b) 



18.5 
QCD 

QCD is an abbreviation for quantum chromodynamics, meaning the dynamics of 
color charge, and obeys the same mathematical framework as QED. The QCD La- 
grangian can be obtained from that of S [7(2) in Eq. (18.68) by simple replacements: 



kK3-2>- 






where i?, G, B are Dirac fields having color fiG£,A a/ , eight gluon vector fields, A a 
Gell-Mann's 3x3 matrices and the structure constant eijk is replaced by fiji (see 
Appendix G). Corresponding to the eight generators, there are eight gauge fields 



A a/i (a = 1, • • • , 8) and the covariant derivative is defined as 

D fl = 9„ + ig s A fl -t = 3 fl + ig s £>.„y (18 ' 73) 

The QCD Lagrangian is given by Eq. (18.68) with the modifications described 
above. 

Who invented QCD is not clear. Many people suggested the existence of color de- 
grees of freedom, including Gell-Mann. In 1964-1965, dynamical models to ascribe 
the source of the strong force to color were proposed [185, 193], but their models 
had a somewhat complicated symmetry structure. A turning point was the discov- 
ery of asymptotic freedom in 1973 by Gross, Wilczek and Politzer [189, 190, 324]. 
Asymptotic freedom means that the effective quark-gluon coupling constant a s = 
g|/4jr is a decreasing function of the momentum transfer Q 2 , or more specifically 
behaves like (In £) 2 ) -1 . This means limQi^^ a s (Q 2 ) -> 0, namely a perturbative 
approach can be used in the high-energy region. This provided a theoretical justi- 
fication of the parton model and a means to improve it. As a result, QCD surfaced 
as the sole candidate for the strong interaction. Asymptotic freedom is a conse- 
quence of the nonlinear gluon coupling. Color confinement is considered another 
consequence. Investigations using lattice QCD 8) are suggestive of it. 

Since QCD is a field theory very similar to QED except for the color degree of 
freedom and nonlinear coupling of the gluon field, the calculation of perturbation 
series, i.e. that of Feynman diagrams in the lowest order, tree approximation, is 
almost identical to that of QED except for color counting. 9 ' When perturbative 
corrections are made to the parton model, such as emission of gluons from the 
quark, the structure functions develop as a function of Q 2 , i.e. a shift of the parton 
model from scaling at large Q 2 [ > 20 (GeV/c) 2 ] is apparent. Comparison of those 
predictions with precision data from electron, muon and neutrino deep inelastic 
scattering, as well as phenomena in e~ e + , established the validity of QCD in the 
late 1970s. 

Hadron-hadron interactions can also be interpreted by the parton model 
[Eq. (17.150)]. QCD gave its justification and provided a recipe for how to cal- 
culate higher order corrections. A notable feature of quarks and gluons is that they 

8) Lattice QCD is a technique to calculate values In QED, they compensate each other 

of various dynamical variables numerically. internally and never appear as real photons. 

Four-dimensional space-time is divided In QCD, because of its nonabelian nature, 

into a lattice of interval a, breaking Lorentz the longitudinal and scalar gluons do not 

invariance but in such a way as to preserve compensate each other exactly. A ghost field, 

the gauge invariance. In the limit a — > 0, the a scalar but obeying Fermi-Dirac statistics, 

power. Lattice QCD provides a powerful but an artifact arising from adopting the 

method to treat the nonperturbative aspect of covariant gauge. If one works in the axial 

QCD. (A 3 = 0) or temporal (A = 0) gauge, one 

9) If one formulates QCD in the covariant does not need the ghost, but the calculation 
gauge, longitudinal as well as scalar gluons is generally easier in the covariant gauge. See 
are quantized, too. They are unphysical Sect. 11.7 for the existence of the ghost field, 
and should not appear in the observables. 



762 I 18 Gauge Theories 

are confined inside hadrons and at high energies only observed as jets. By observ- 
ing jet phenomena, it became possible to treat the interactions between quarks and 
gluons at high energies quantitatively with high precision. 

On the other hand, at small Q 2 < 1 (GeV/c) 2 asymptotic freedom does not apply 
and a nonperturbative treatment is necessary. Lattice QCD provides a way to cal- 
culate hadron masses, decay constants and other dynamical variables in which a 
nonperturbative treatment is necessary. Also chiral perturbation theory was devel- 
oped based on QCD; it handles many low-energy phenomena, including decays of 
heavy quarks (c, b, t). QCD is now a standard tool in nuclear physics, too. 

As a theory that can reproduce various aspects of hadronic phenomena and that 
can calculate strong force corrections to electroweak theory, QCD, in parallel with 
GWS electroweak theory, has established itself as the Standard Model of particle 
physics. It has also developed new phenomena of its own, including the instanton 
effect and the strong CP problem with its associated prediction of axions and their 
possible effect on observables. 

18.5.1 

Asymptotic Freedom 

In Sect. 8.1.5, we calculated the QED coupling constant as a function of Q 2 in 
Eq. (8.63), which is also referred to as the running coupling constant when its 
dependence on Q 2 is emphasized: 

yv>l i + n y (Q 2 ) 
n y (Q 2 ) = n y (0) + n y (Q 2 ) 

/7 y (0) = 0, 77 ) ,(Q 2 ) = -^ln(Q 2 /m 2 ) Q 2 » m 2 (18.74) 

where the coupling a = el/4jt, which is defined in terms of the bare charge e , 
is to be considered as a mere parameter. Q 2 = — q 2 is the 4-momentum squared 
of the virtual (space-like) photon and m e is the electron mass. /L(0) is a divergent 
quantity and has to be treated with the renormalization recipe. However, fl y (Q}) 
is finite and physically meaningful. Expression (18.74) is obtained by summing a 
chain of vacuum polarization diagrams (a)-(e), which are repetitions of the one- 
loop contribution. Expressed in a somewhat different form 

7777^ = 7T ~ B <2 2 ) - *&) = ~WvM + %(&)} (18-75) 

a(Qf) a a 

The renormalization procedure replaces the divergent term by an observable, the 
coupling constant a(0) measured at infinite distance (Q 2 = 0). Namely, by 



^wwv^ = Vwwwi + Lvo>aJ + L0W0J+ y^y^Ch<y4 H 



Figure 18.13 QED: Diagrams that are collected to calculate the running coupling constant: 
(a) is a symbolic expression for the sum of (b)-(e). (a) is called the leading log approximatioi 
(LLA). 



Substituting Eq. (18.76) in Eq. (18.75), we have 

\t-[B(Q 2 )-b(0)] = - 1 
P(Q 2 ) = 



«(Q 2 ) o(0) 



We now have a divergent-free (renormalized) expression for an effective coupling 
constant in terms of an observable quantity: 



The above expression for the ft function was obtained as the first-order perturba- 
tion solution, picking up only the dominant In Q 2 term in the limit Q 2 » m 2 . 
However, the coupling constant obtained from the above expression actually con- 
tains the series of higher order corrections depicted in Fig. 18.13, as can be seen by 
expanding Eq. (18.78) in powers of a(0), and it is called the leading log approxima- 
tion (LLA). 

In QCD, we cannot subtract the coupling constant at Q 2 = 0, as it is infinite 
because of confinement. Instead, we choose some reference value Q 2 = [i 2 to 
define the coupling constant. Denoting the QCD coupling constant as a s (Q?), we 
can write 

(18.79) 



Diagrams that contribute to the /5 function in the lowest order or LLA approxi- 
mation to a s (Q?) are given in Fig. 18.14. The /3 function is expressed as 10 ' 

(18.80a) 

y8 = - ( -n f + 5 - 16 ) (18.80b) 

10) The contribution of each diagram is gauge dependent. The physical interpretation is most clear in 
the physical gauge, where only two transverse gluons are quantized and the longitudinal gluon 



764 I 18 Gauge Theories 




Figure 18.14 QCD: Diagrams that contribute verse gluon pair, (c) Longitudinal gluon and 

to the quark-gluon running coupling con- transverse gluon pair. Repetition of types (a)- 

stant. (a) is common to QED and QCD. (c) are all added in LLA calculation. 
(b,c) Gluon self-coupling diagrams, (b) Trans- 

The first term is the contribution of the fermion loop and ttf is the number of 
quark flavors. 11 ' The factor (+5) in the second term in Eq. (18.80b) originates from 
pair creation of gluons having transverse polarizations. The third term (—16) is 
a contribution of one transverse gluon created together with a longitudinal gluon 
(Fig. 18.14c). The reason for both the first and second terms being positive is related 
to unitarity [Sect. (9.1.2)], i.e. Im(T /; ) - £„ \T ni \ 2 . The amplitudes of the diagrams 
in Fig. 18.14 are proportional to a(qq -+ ff, gg) for Q 2 = — q 2 > 0, as can 
be seen by cutting them across the loop. Namely, fermion pairs and transverse 
gluon pairs are physical and can be produced if energy-momentum conservation 
is satisfied. However, the longitudinal gluons are unphysical and do not contribute 
to the cross section. Therefore the /? function can be negative, and in fact it is large 
and negative. One can see that as long as (2/3) ti{ < 11, the /3 function is negative 
(/? < 0). The QCD running coupling constant is now expressed as 

tts(Q2) = , + a ,^f\ u g (18 ' 81) 

1 + -j^ 1 (11- f Wfjln^- 

which approaches zero as Q 2 —*■ oo (asymptotic freedom). Asymptotic freedom 
justifies the use of perturbation theory at large Q 2 (phenomenologically at Q > 
1 GeV/c) and the parton model, which treats quarks as point particles. The differ- 
ence between QED an QCD is illustrated in Fig. 18.15. It is due to the color degree 
of freedom and nonlinear gluon couplings in QCD. The vacuum polarization effect 
due to physical fermions and gluons serves as a screening effect (similar to dielec- 
tricity in electromagnetism), but the effect of the unphysical gluon loop is opposite 
and acts as antiscreening. Note that the QED coupling constant is running, too, 
and when Q 2 is sufficiently large, the denominator of Eq. (18.78) could vanish for- 
mally, but if one calculates the value of Q necessary to achieve this, it is absurdly 
large and not attainable experimentally. In practice, the variation of a(Q 2 ) is very 
small in the range of Q 2 achievable today. 

11) Note that in QED n f = 1 and /? = -2/3, which agrees with Eq. (18.74) if we take into account 
the normalization difference of QCD, where what corresponds to e in QCD is actually g/ v^2 for 
historical reasons. (See the second line of Eq. (18.70) for why g/\/2 is used.) n; is the effective 
number of quark flavors (not always 6), whose mass satisfies the condition m f < Q. 
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Figure 18.15 (a,b) Pictorial illustrate 
screening and antiscreening in QED/QCD. (a) 
The strength of QED coupling constant seen 
by a test charge decreases as it goes away 
from the source (dielectric effect of the vac- 
uum), (b) The strength of a s seen by a test 
color charge (another quark) increases due to 
the antiscreening effect of gluon self-coupling 



and becomes the confining potential, (c) 
the QED coupling constant seen by a tes 
charge, is finite at r = oo {Q 2 = 0) anc 
creases at shorter distances as the screei 
is weaker. a s is meaningless at large dist 
r > 10~ "cm because of confinement. 



On the other hand, the variation of a s (Q?) within the technically achievable 
range is sizeable. Therefore, the treatment of the running coupling constant nor- 
malized at an arbitrary value of fi 1 takes a bit of getting used to. The observable 
quantities should not depend on which fi one chooses. The requirement takes the 
form of a renormalization group equation and constrains the behavior of the cou- 
pling constant. Here, we only show that regardless of ji 1 choice we can choose a 
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Figure 18.16 Measured a 



\tfx\fx corresponds to Q in the text [311]. 



universal constant that is the fundamental constant of QCD. Equation (18.79) with 
Eq. (18.80) substituted gives 



The lhs is a function of Q 2 and the rhs of ft 2 only. Consequently they are constant 
and we equate them with — (/3 /4jt) In /1q CD : 



« S (G 2 ) 

« S (Q 2 ) = 



A) hi-. 



(33-2n f )ln 



Figure 18.16 shows observed values of a s at various Q 2 and indeed confirms the 
logarithmic dependence of the coupling constant. Note, however, that what is ac- 
tually measured is a(Q 2 ) and vlocDhas to be calculated from it. The experimen- 
tal value of Aqqo is 150-300 MeV. In principle, it should not depend on Q 2 , but 
Eq. (18.83) is an approximation using one-loop diagrams. 12 ' There still is great 
uncertainty as to the true value of ^Iqcd- 

It is interesting to note that Eq. (18.83) becomes infinite at Q = ^Iqcd, which 
is on the order of 0.5 fm = 0.5 x 10 -13 cm, roughly equal to half the hadron size. 
As the formula is valid only in the asymptotic region, Q has to be at least larger 
than ~ 1 GeV/c, where a s (l GeV/c) ~ 0.5 for the formula (18.83) to be valid. It is 

12) For higher order expressions of a s (Q 2 ) or Aq C d see [311]. 



conventional to express experimental values of a s converted to that at Q = r 
91 GeV/c. The experimental value is [311] 

a s (m z ) = 0.1170 ±0.0012 (1 



Confinement 

Confinement is also considered to be a result of nonlinear gluon coupling. It is not 
rigorously proven yet, although lattice QCD calculations are indicative of it. Qualita- 
tively, the reasoning for confinement is well understood. The argument closely fol- 
lows that of the Meissner effect in superconductivity. It is a perfect diamagnetism 
induced by a Bose-Einstein condensate of Cooper pairs. They are bound states of 
an electron pair with opposite momentum and spin orientation attracted to each 
other through lattice vibrations (phonons). The attractive force, normally hidden 
by thermal motion, manifests itself at low temperature. When the magnetic field 
tries to penetrate any medium, eddy currents made by freely moving electrons are 
produced to compensate it, a familiar phenomenon in electromagnetism that is 
referred to as Lent's effect. Normally the effect is small, but in the superconductor, 
the compensation works maximally and shows perfect diamagnetism. In the lan- 
guage of field theory, the photon has acquired a mass and the force becomes short 
ranged. 

The invasion of the magnetic flux into the superconductor is limited by the 
Compton wavelength of the massive photon. If the strength of the magnetic field 
exceeds some critical value B c , the magnetic energy can destroy the Cooper pairs 
and melt the condensate, and the superconductor is converted to a normal con- 
ductor. This is referred to as a type I superconductor (Fig. 18.17a). In a type II 
superconductor, the magnetic field can partially break the superconductivity for 
the field range B cX < B < B c2 , and flux tubes that penetrate the superconductor 
are created (Fig. 18.17b). Within the narrow confined region of the tube, B > B c2 
and the medium is normal conducting. Outside the tube, it is superconducting and 
the magnetic flux is repelled from it by supercurrent circulating around the tubes. 
A simple explanation of superconductivity by Ginzburg and Landau is given in the 
boxed paragraph on p. 769Confinementequation.18.85. 

Now imagine a colored quark in a medium in a sort of superconducting state, 
but with the roles of the magnetic and electric fields exchanged. The color electric 
field is created around the quark. In its vicinity, the field strength is strong enough 
that it melts the superconductivity of the medium, and the flux expands isotopically 
(Fig. 18.17c). The flux density decreases in proportion to the inverse of the distance 
squared and the field strength is of Coulomb type, just like the electric field in 
QED. At long distances, the field is not strong enough to destroy the superconduct- 
ing state. As the total flux must be conserved unless it is absorbed by (an)other 
anticolored quark(s), the requirement of minimum energy state constrains the flux 
to be confined in a narrow tube with constant cross-sectional area, keeping its field 
strength above the critical value. This penetrates the medium (Fig. 18.17d) just like 
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Figure 18.17 Meissner effect, (a) Type I super- 
conductor. The magnetic flux is repelled from 
the superconductor for 8 < 8 C . (b) Type II 
superconductor. For 6 c i < 6 < B c j the mag- 
netic flux is squeezed and flux tubes can pene- 
trate the superconductor. Inside the tubes it is 



a normal conductor, (c) The color electric field 
flux expands uniformly in the vicinity of the 
color charge and the force is Coulomb-like, (d) 
For large r, the flux is squeezed to form a tube 
with constant cross section. The flux density 
and the force strength stay constant. 




Figure 18.18 Confinement potential 
V[r) = or- e/r, (a ~ 0.1 GeV X 1 
P = 2N/g 2 for SU(N) [39]. 



Mated by lattice QCD. The Cornell potential is given by 
cm, e ~ 0.7GeV/lCT 13 cm) normalized to V(r ) = 0. 



the magnetic flux tube in the type II superconductor. As the flux per unit area in 
the tube is kept constant, the strength of the force is also kept constant. Its accu- 
mulated energy grows in proportion to the tube's length, in other words, it behaves 
like a stretched string with constant tension. The tube is terminated by another an- 
ticolor charge or the antiquark. Thus the potential energy between the two quarks 
is expressed by a combination of the Coulomb and linear potentials: 

V(r) ~ - + br (18.85) 

It qualitatively reproduces the confining potential introduced to explain charmoni- 
um levels in Eq. (14.60). Figure 18.18 shows an example of a lattice QCD calculation 
of the potential. 



Ginzburg-Landau Theory of Superconductivity 

Superconductivity is characterized by its permanent electric current and per- 
fect diamagnetism (Meissner effect [273]). When we write e* (= 2e) and 
m* (= 2 m) for the electric charge and mass, respectively, of the supercur- 
rent carrier (i.e. Cooper pair), its behavior in the magnetic field is described by 
the Ginzburg-Landau free energy [170] 



where F n is the free energy of the normal conductor and xp is the Cooper pair's 
wave function, commonly referred to as the order parameter. The equation of 
motion for \p and A can be obtained by minimizing F, which gives 



V x (V x A) = j = — [V*(-»VV) + WW] " ^y j * 1 P A 

(18.88b) 

At T > T c the free energy has a minimum at | tp | =0. This can be seen by 
looking at the potential part of the free energy (i.e. the last two terms), but 
below the critical temperature, T < T c , the minimum moves to \ip\ = ip = 
^—a/fi / (see Fig. 18.23 for the potential shape). This is a Bose-Einstein 
condensate state, because as T -*■ all the Cooper pairs lie in the ground state 
with density given by | ip | 2 = n s . Writing ip(x) — xp + <p(x), we consider only 
the ground state, i.e. 4>(x) — 0. Then the kinetic energy term in Eq. (18.88b) 
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leglected. Taking a time derivative o 
x A, we obtain 



ofEq. (18.88b) and u 



E dt dt \X 2 ) 



AB- — B 



= 0, 



(18.89) 



X is referred to as the (London) penetration depth. 

The first equation gives the supercurrent with perfect conductivity because 
any electromagnetic field produces accelerating current and the second the 
Meissner effect. Equations (18.89) mean that the photon has acquired a mass 
fi = h/Xc. 

Consider a medium with boundary at x = (see Fig. 18.19). The re- 
gion (x < 0) is the vacuum in the magnetic field B = (0, B, 0) and that 
(x > 0) is superconducting. Equations (18.89) have a solution B(x > 0) = 
(0, Be~ xix ,0), which connects with B(x < 0). One sees that the magnetic 
field penetrates only to the penetration depth ~ A, which is typically hundreds 
of atomic distances, a microscopic size. At the boundary, the density of the 
Cooper pair does not rise abruptly, but reaches its bulk value over the coher- 
ence length defined by § = h/^/m*\a\. This can be seen from an approximate 
solution (xp ~ e~ x ^) to Eq. (18.88a), which can be obtained by setting A = 
and fi\ip\ 2 <H a. The ratio k = Xj% , referred to as the Ginzburg-Landau 
parameter, discriminates between type I and type II superconductors: 



k = A/| , type I k < 1/V2 , type II K > 1/V2 



(18.90) 



Vacuum y Super-c 

IV l 2 = n s 




Unified Theory of the Electroweak Interaction 



18.6.1 

SU(2) x Ufl) Gauge Theory 



In Chap. 15, we learned that the weak interaction is of vector type, that it acts only 
on left-handed particles and that W ± couple to isospin current, suggesting W ± 
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are members of an isospin triplet and the existence of a neutral boson W°. All this 
points to a gauge theory of S U(2) symmetry. But there are two objections to be 
addressed. One is that the weak interaction shares part of the isospin current with 
the electromagnetic interaction and may not obey strict S U(2) symmetry. The other 
is the short-range nature of the interaction, or equivalently the finite mass of W^, 
which obviously contradicts the notion of W ± being gauge particles. Deferring the 
mass problem until later, we will first address the fusion of the weak force with the 
electromagnetic force. 

Since the electromagnetic current consists of isospin and remaining current [see 
Eqs. (15.98)], we consider the photon as a mixture of two gauge fields, one generat- 
ed by the isospin and the other by a yet to be determined symmetry. We assume it 
to be U(l) for simplicity. Then the electric charge consists of isospin and the U(l) 
hypercharge, which are related to the electric charge by 

Q=^3+! (18.91) 

which defines the hypercharge of particles. The factor 1/2 is introduced to make the 
formula the same as the Nishijima-Gell-Mann formula. 13 ' A new ingredient that 
did not exist in the isospin of the strong interaction is that we need to discriminate 
left- and right-handed particles as having different isospin and hypercharge. It is 
necessary because the isospin operator, the source of the weak force, acts only on 
left-handed particles and the hypercharge has to make a necessary adjustment to 
keep the charge Q independent of handedness. This means that we regard the left- 
and right-handed particles as different particles. 

Let us consider a doublet consisting of (v e , e~) for concreteness, though the 
following arguments are valid for any isospin doublet W, including the quarks. A 
charged current made of a left-handed doublet 

j+" = v L y f 'e L = F L y"r + y L (18.92a) 

W L = r L l (18.92b) 



Hi] 



couples with W , i.e. the weak interaction Lagrangian is assumed to be given by 

= g«W h Y" j-J=(r + W+ /( + T_W-„)+^T W „j W L (18.93a) 
W T = ■^-(W l ±W 1 ) , w°=W it r ± = -(Ti±tr 2 ) (18.93b) 



13) Note, the hypercharge thus defined is different from that of the strong interaction. In strong 
interactions, hypercharge is Y = (B + S + C + B--- )/2, where B is baryon number and S,C,--- 
are flavor numbers corresponding to strangeness, charmness, etc. and not sources of force fields. 
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where we have included the hypothetical W° to make the interaction consis- 
tent with S 1/(2) gauge interaction [see Eq. (18.70)]. With the assumption that v R 
does not exist, 14 ' e R has no partner and constitutes a singlet. The hypercharge of 
v L , e L , e R is determined from Eq. (18.91) 

Y(v L ) = Y(e L ) = -1 , 7(e R ) = -2 I5) (18.94) 

Denoting the gauge field associated with the hypercharge as £>'', the Lagrangian 
satisfying independently S 1/(2) and 1/(1) symmetries [denoted as S [7(2) x 1/(1)] 
can be written as 

X m = Tj7 L iyf<D f ,y L + -e R iy f 'Di,e R - -F„ v ■ F f,v --B^B 1 "' (18.95a) 

D fl = 19,, + ig w W /t ■ t + i(g B /2)B fl Y (18.95b) 

The factor 1/2 for g B is for later convenience and has no special meaning. Since 
^l and e R have different isospin and hypercharge, the isospin operator t and the 
hypercharge operator Y have different eigenvalues depending on the operand. For 
instance, te R = 0, YW L = -W L , Ye R = -2e R . Notice that the Lagrangian has no 
mass term. If it does, the left- and right-handed components mix and no unique 
quantum numbers can be assigned independently (see arguments in Sect. 4.3.5). 
In other words, in a world where both left- and right-handed particles respect inde- 
pendently gauge invariance (chiral invariance), the fermion cannot have mass. Let 
us close our eyes to massless fermions for a moment and proceed. 

Extraction of the W;f part of the Lagrangian Eq. (18.95a) reproduces Eq. (18.93a). 
Extraction of terms including W° and B M gives 

-£nc = y (ny"^-^/'^) W% 

+ -yT-(n.y''v L + e L y f 'e L )-2e R y f 'e R ^B /i 
j (18.96) 

= ^(gu,W%-g B B fl )(v L yf'v L ) 

~ j (Ew W% + g B B u ) (e L y"e L ) - gs B M e K y "e R 

where (—1) and (—2) in front of V L )"" r L and e R y ^ e R in the first line reflect their hy- 
percharge. Since both gauge fields B and W° couple to the same current, {v^y^v^ 
and e L ( R )/''e L ( R )), they mix through fermion loops (Fig. 18.20). As the neutrino 

14) Since the discovery of neutrino oscillation, 15) For quarks, (u L , d L ) couple with W, 

this assumption is no longer true. However, constituting an isodoublet, but u R , d R 

only necessary modification is to include v R ^("0 = Y[d L ) = 1/3, Y(u R ) = 4/3, 

in the Lagrangian, which is rather academic Y(d R ) = —2/3. The same rule applies to 

under the present circumstances. The (c, s) and (t, b) quarks, 
assignment I(e R ) = holds regardless of the 
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Figure 18.20 Mixing between 6 and W° occurs when both 
fields couple to the same fermion pair. 

should not couple to the electromagnetic field, we define the combination that a 
pies to the neutrino as Z /( and its orthogonal combination as the electromagne 
field A„: 




W° fl -g B B /t ) = cos0 w W% -sin0 w B„ (18.97a) 
(gB W°„ + g w B„) = sin W W° fl + cos W B„ (18.97b) 



which defines the Weinberg angle f? w . Inserting Z fl , A^ defined in Eq. (18.97) back 
in Eq. (18.96), and rewriting their coefficients using Q = I 3 + Y/2, we obtain 

-£nc = g™ sin 0w (e L y" Qe L + e R y'' Qe R ) A M + Jg\ + g\ 

x [h (vlY^l + eLy^L) - Qsin 2 W (e L y''e L + e R y>'e R )] Z„ 
= QeWy^A u W + g z Wy>' Z fl (I iL - Qsin 2 W )W (lg 98a) 

(18.98b) 

where /3L means that it acts only on the left-handed part of the field. The first 
term is the electromagnetic interaction and the second a new weak neutral current 
interaction that couples to Z„. Because of mixing between S U(2) and 1/(1), the 
current that couples to the weak neutral boson Z„ is a mixture of the left-handed 
neutral current and the electromagnetic current. Hereafter, we restrict the use of 
the word neutral current to that which couples exclusively to Z fl unless otherwise 
stated. 

We have, starting from the chiral world, where left- and right-handed particles 
are independent, succeeded in extracting a left-right symmetric current, which 
can be identified as the electromagnetic current. It is a gauge theory based on 
S U(2) x U(l) and contains the neutral weak gauge fields Z° as well as the electro- 
magnetic field. Note, however, that the above formalism is valid only for massless 
gauge fields as well as for massless fermions. Adding mass terms by hand makes 
the formalism phenomenologically usable in many applications today. Using the 
above Lagrangian and equating the W exchange transition with the four- Fermi 
interaction of the weak interaction, we obtain 

G F g 2 e 2 

-4= = ~^t- = n ^— r (18.99) 

P) Sm 8m sin 8 

sin 2 # w is referred to as the Weinberg angle, and if it can be determined by experi- 
ments, the W mass would also be determined. A shortcoming of the theory is that 



774 I 18 Gauge Theories 

the mass is put in by hand, thereby destroying the gauge invariance. No reliable 
higher order calculations are possible either. 



Spontaneous Symmetry Breaking 

We have learned in the previous section that we could construct a satisfactory theo- 
ry, at least phenomenologically by putting the mass terms in by hand. But by doing 
so, the gauge invariance is broken, making it unrenormalizable, with resultant loss 
of predictive power. Here we demonstrate that despite apparent breakdown of the 
symmetry, there are cases in which the symmetry is preserved deep in the roots of 
the mathematical formalism. We might well say the symmetry is not broken but 
hidden. 

An equation with some symmetry generally has solutions that respect the same 
symmetry, but those solutions are not necessarily stable. Consider, for instance, 
a stick standing upright on a plane. Apply pressure straight down from above 
(Fig. 18.21). Within the elastic limit it is compressed, but beyond a certain pressure 
it is bent and stays that way. The solution is supposed to have rotational symmetry 
around an axis, but the realized stable solution obviously breaks it. This is a case 
in which the equation respects the symmetry but its solution does not. A remnant 
of the symmetry is there in that any direction can be chosen and the realized state 
has the same energy regardless of the chosen direction. Namely, the ground states 
are infinitely degenerate. 

Another example where field theory can be applied is to a ferromagnetic sub- 
stance. Ferromagnetism is produced as a result of electron spins all being aligned. 
At high temperature, the direction of the spins is random due to thermal motion, 
chaos in a sense, but since there is no special direction, the rotational symmetry is 
respected. When an external magnetic field is applied, all the spins are aligned and 
the rotational symmetry is broken. This is the case that the symmetry is broken by 
introducing a symmetry-breaking perturbation by hand in the system. The external 
field is not rotationally symmetric, which is the origin of the symmetry breaking 
of the realized state. However, even without a symmetry-breaking perturbation, the 
spins can be aligned. This happens below a critical point, the Curie temperature. 
Below it, spin-spin interaction, which by itself is rotationally symmetric, wins over 
thermal turbulence and the spins align themselves (Fig. 18.22a). The chosen di- 
rection is quite arbitrary, because the states are infinitely degenerate, but once a 
direction is chosen, all the neighboring spins follow the initiator's trigger and align 



Figure 18.21 A stick under pressure. The bent position exhibits a 
ground state that breaks the rotational symmetry and infinitely degen- 
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Figure 18.22 (a) Magnetization pattern as a on a {1 10} surface away from a crystal 

function of temperature. Below the Curie tern- (c) Inferred domain magnetization dire 

perature (7 C ), spins are ordered and magneti- Figures taken from [309]. 
zation appears, (b) Domain patterns observed 



along the same direction; the chosen direction is frozen. No local perturbations can 
change it because it takes a vast amount of energy (binding energy per spin times 
the number of spins). Such processes are observed commonly in condensed matter 
physics. Common features of them are 



1. There is some critical temperature T c . 

2. A configuration with good symmetry becomes unstable below the critical tem- 
perature and changes to another stable state where the original symmetry is 
broken. 

3. Realized stable states (ground states) are continuously (infinitely) degenerate. 



When the process happens, it is called spontam 
chosen direction need not be the same in one plac 
apart. Because of this, domains of differing spin dire 
A little man living in such a world would never ii 



s symmetry breaking. The 
s in another if they are far 
>n are formed (Fig. 18.22b). 
;ine that the equation that 



governs the direction of spins respects rotational symmetry. Such a macroscopic 
metamorphosis at a critical temperature has been known quite a while as a phase 
transition and its formalism is well developed in condensed matter physics. It was 
Nambu who introduced the idea of spontaneous symmetry breaking into particle 
physics. He investigated superconductivity from the viewpoint of gauge symmetry 
breakdown and applied it to particle physics [287, 289, 290]. 

The mathematical formalism of spontaneous symmetry breaking goes like this. 
Consider a self-interacting complex scalar field whose expectation value is taken to 
be the magnetization we talked about. Its Lagrangian has the form 16 ' 



16)TheHamiltonianth: 
<p is an order parameter 



at can be derived from Eq. (18.100) is referred to as ft 
eter in the terminology of condensed matter physics. 
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Figure 18.23 (a) At high temperature 

T < T c , |0| = state is unstable, but the st 



0. (b) Atlowtemperatur 
infinitely degenerate. 



x G = d /i 4>d"(p-v{(p) 

V{4>) = x Lt0 + M = f i 2 (<t><ct>) + i(^'<t>) 2 + v(0) , 

A>0 [i 2 = a(T-T c ) 



(18.100b) 
(18.100c) 



The constant term V(0) can be chosen arbitrarily, because we are only dealing with 
the energy difference or excitations from the ground state. 17 ' The Lagrangian is 
invariant under gauge transformation 



i-»>0' = 



"0 , T " 



= <P~*'e' 



(18.101) 



Note, when | cp | is to be interpreted as the magnetization, the rotational symmetry 
is in real space. Here we use rotational symmetry in the complex internal space 
because we want to discuss gauge symmetry. From now on we talk about the com- 
plex scalar field in particle physics and the field of magnetization interchangeably. 
When we talk about magnetization, the lowest energy state is referred to as the 
ground state. 

At high temperature ( T > T c ), the potential is of the form depicted in Fig. 18.23a 
and the lowest energy (the ground state) is realized with | <j> \ — or zero magnetiza- 
tion. At low temperature (T < T c ), however, [i 2 < and the shape of the potential 
is like a Mexican hat or the bottom of a wine bottle (Fig. 1 8.23b). cp = is an 
unstable solution, and the ground state is at |0| = v/\/l = ^/—/^ 2 /2l; the magne- 
tization is finite. The ground state can be anywhere that satisfies \(p\ = v/\/2 along 
a circle on the bottom of the wine bottle. Putting = ((pi + icp2)/V2 we choose 
the ground state at 



1 7) V(0) is considered as the va 



ergy. Its absolute value becomes important if gravity m 
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As the ground state is the vacuum in field theory, the field cp has a finite vacuum 
expectation value: 

(0) = (O|0|O) = v/V2 (18.103) 

Since we always measure excited energy (particle creation) from the ground state 
(vacuum), and the new vacuum is at <p x = v, we define a new field 



q expectation value 

(0i> = (02> = (18.105) 

The field described in terms of (p' v <j> 2 does not respect the original gauge (rota- 
tional) symmetry. This is a case of spontaneous symmetry breaking. In field theo- 
ry, once the vacuum has been chosen, it is fixed, because there are infinitely many 
degrees of freedom and no matter how small the microscopic energy needed to 
change the configuration, if multiplied by infinity, it is infinite. In the case of con- 
densed matter physics, the number of degrees of freedom is large but finite (of the 
order of the Avogadro number) and the ground state can be changed by heating or 
cooling the bulk of the matter. The spontaneously broken symmetry in field theory 
is due to the vacuum that surrounds us, and we are the little men in the ferro- 
magnet we talked about. We cannot change our vacuum unless we heat the whole 
universe! We are in one of the domains. Our whole universe is most likely in one 
domain. 18 ' 

Note that spontaneous symmetry breaking does not break the symmetry math- 
ematically. Observed phenomena were simply reexpressed in terms of cp[ and <p 2 . 
Mathematically, no symmetry-breaking terms are introduced. All the phenomena 
can equally be described in terms of original variables that respect the symmetry, 
though in this case physical interpretation is much harder. 

Nambu-Goldstone Boson What does the new field represent? In terms of <p' v cp 2 , 
which we rewrite as <pi, <p 2 , the Lagrangian is expressed as 

£g = \ [3„0i3"0i - (-2^ 2 )0! 2 ] + \ [9„0 2 3"0 2 ] 

r k -. (18.106) 

+ Ilv0 1 |0| 2 + -|0| 2 l 

The first term represents a massive scalar field with mass •sj—lfi 2 , or the magne- 
tization we talked about, the second is a massless scalar field and the third is the 

18) The boundaries of domains are known as "defects" and they are commonly observed in condensed 
matter physics. There are conjectures that defects may have been formed at the big bang and their 
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interaction between the two. A massless scalar boson appears if a continuous sym- 
metry is spontaneously broken with infinitely degenerate ground (vacuum) states, 
a generalization of Nambu's proposal, which is referred to as the Goldstone theo- 
rem [180]. The massless boson is referred to as the Nambu-Goldstone boson. Does 
such a particle exist in a ferromagnet? The answer is yes, and it is called a magnon. 
Although changing the chosen state to another is difficult, it is possible to give 
a small perturbation locally. Then the spin-spin interaction, which was the cause 
of the spontaneously broken state, transmits the perturbation to neighboring spins 
and the perturbation propagates, making a wave (Fig. 18.24). The magnon is a spin 



CXZ 
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Figure 18.24 When the magnetization is This means that its quantum, called the 

aligned, a spin wave, a deviation from the or- magnon, is a massless particle commonly 

dered state, appears. In the long-wavelength referred to as the Nambu-Goldstone boson, 
limit, the ordered ground state is recovered. 

wave. That its long-wavelength limit recovers the static ground state, i.e. w — 
as 1 — ► oo or fe — >• 0, means the mass of the quantum is zero. Why does such 
an extra boson appear? Originally there were two fields to represent a continuous 
symmetry. After condensation, all particles (the field's quanta) are ordered and oc- 
cupy the same state. There is no need of two fields to describe an ordered state; 
one is enough. Its quantum has to be massive to provide some stability to the state. 
The residual field is not constrained and its quantum is massless. One sees in 
Fig. 18.23b that in the <p\ direction it is at the minimum of a potential and takes 
some effort to move it (excite the field, i.e. create a particle), but 2 lies on a flat 
circle, and needs no force to move it. 

The question is: do we have Goldstone bosons as an indication of symmetry 
breaking? 

18.6.3 

Higgs Mechanism 

In nature, no massless Goldstone bosons are known and there was difficulty in 
accepting the notion of spontaneously broken symmetry. However, there is a loop- 
hole in Goldstone's theorem and the Goldstone boson does not appear when the 
symmetry is associated with a long-range force like that created by the gauge field. 
In the absence of a long-range force, the wave in the Goldstone mode can prop- 
agate but the force's action does not reach beyond a few neighboring particles. If 
the force is of long range, particles far away are affected simultaneously, and they 
act coherently, constituting a longitudinal wave. In a plasma, as a result of the 
Coulomb force, a longitudinal wave called plasma oscillation is formed. Its action 
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screens the Coulomb force, converting it to a short-range force. In the language of 
field theory, the quantum of the plasma oscillations has mass, namely the photon 
has acquired mass and has become a plasmon. Higgs and others [131, 191, 205] 
showed that when the symmetry is broken in association with the gauge field, the 
Goldstone boson does not appear but instead the gauge boson acquires mass. We 
will see how. 

Mass of the Gauge Boson When a gauge field is introduced, the Lagrangian con- 
taining the self-interacting scalar field (hereafter referred to as the Higgs field) and 
the gauge field is expressed as 

£ H = {D^ l (py(D f '(p) - V(cp) - -F^ v F flv (18.107a) 

Df, = df, + iqA fl , V(<p) = fi 2 {cp'(p) + A(01» 2 (18.107b) 

Let us see what happens after the gauge symmetry is spontaneously broken. It is 
convenient to decompose <p not linearly like (p = (rpi + i(p 2 )/\/2 but exponentially 



Exponential Decomposition 

Mathematically it is always possible to transform into this form as long as 
the number of degrees of freedom does not change. Then the difference of 
02 in the two expressions is a matter of physical interpretation. tp 2 in this 
form is called a phase field. If expanded in powers of (pi\v, it is the same as 
the expansion cp = (v + (p\ + i(p 2 )/V2 if higher order terms are neglected. 
Namely, as long as the excitation of the field is small, where the particle con- 
cept of the field is valid anyway, the difference is small. Remember, the field 
quantization is applied to harmonic oscillators or using up to quadratic terms 
in the Taylor expansion of the potential around the minimum and the rest 
is treated as interaction terms. The difference appears as a global feature of 
the field, which reflects the collective effects of the interaction. For instance, 
a local plane wave could globally be a part of spherical wave at far distance or 
circulating wave whose radius is large. In the spontaneous symmetry break- 
ing, we are discussing what will happen if we extend our approximation up 
to the quartic power of the Taylor expansion. In Fig. 18.23b 2 is constrained 
to lie on a circle and has periodicity, hence its name. But locally, as long as it 
the neighborhood of tp 2 = 0, the linear treatment is equally valid. 
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Problem 18.5 

Show that for <p[, <p 2 « v, Eq. (18.108) is equivalent to 



If we rename <p[ as <p 1 and replace the gauge field by 
B,, = A„ + — 3„0 2 
the Lagrangian takes the form 

£ H = j [O^iD"^! - (-2/, 2 )^] - Av0? - -0{ 



4 */« 
= 9„B r -9„B„ 



_(^ 



Problem 18.6 
Derive Eq. (18.111). 

In this expression, there is no Goldstone boson. Notice, the second line repre- 
sents a massive vector field [see Eq. (5.22)]. <p 2 is absorbed as the third (longitu- 
dinal) component of the gauge field, which has mass m B = qv. Note that the 
number of degrees of freedom is the same as before. Originally, there were one 
complex scalar field or two real scalar fields and a massless gauge vector boson, 
which has two degrees of freedom. After symmetry breaking, there are one real 
scalar field and a massive vector boson, which has three degrees of freedom. It is 
often said that the gauge boson became fat by eating a ghost. This is the process 
that a component of the Higgs field is converted to the longitudinal component of 
the gauge field to make it massive; it is called the Higgs mechanism. 

In order to choose a vacuum from an infinite number of degenerate states, 
we have used the freedom of gauge transformation Eq. (18.101) as expressed in 
Eq. (18.108) 

-> e -<> «=> -+ ^=e ,(02/F) (18.112) 

In other words, the vacuum has been fixed by a special kind of gauge transforma- 
tion. The gauge thus chosen is commonly referred to as the unitary gauge. 

A Hamiltonian that can be extracted from a nonrelativistic version of Eq. (18.107) 
is a well-known formula in condensed matter physics and is referred to as the 
Ginzburg-Landau free energy for superconductivity (see the boxed paragraph on 
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p. 769Confinementequation. 18.85). <p denotes the Cooper pair wave function. Be- 
low the critical temperature, the Cooper pair is mass produced (condensation) and 
matter is transformed to the superconducting phase. A superconductive medium 
repels the magnetic field by producing eddy currents to compensate the intruding 
magnetic perfectly (perfect diamagnetism referred to as the Meissner effect) except 
in a thin skin depth. This is due to condensed Cooper pairs acting coherently as 
a macroscopic quantum fluid. In the language of field theory, the photon (gauge 
field) has acquired mass in the superconductor and the electromagnetic force has 
changed to a short-range (~ skin depth) force with the range given by the Comp- 
ton wavelength of its mass. In the superconductor, the gauge invariance is broken 
spontaneously. Adopting the Higgs mechanism is to consider that the vacuum that 
surrounds us is in the superconductive phase. The temperature of the phase tran- 
sition is of the same order as the vacuum expectation value v. 

What is the meaning of the vacuum expectation value? In magnetism, it means a 
finite magnetization in the ground state where all the spins are aligned, a situation 
in which a great number of particles occupy the same state, which is referred to 
as (Bose-Einstein) condensation in condensed matter physics. When the field has 
a finite vacuum expectation value, it means condensation to the ground state. All 
the particles act coherently, or as a macroscopic quantum state. In this state the 
characteristic field behavior is more that of a wave than that of particles. In the 
Higgs mechanism, the condensate is the Higgs field, an equivalent of a sea of 
Cooper pairs acting as a superfluid, which screens the force, converting it to a 
short-range force. 

Fermion Mass We have explained the finite mass of the gauge boson as a result 
of spontaneous breakdown of the gauge symmetry. What about the fermion mass? 
The vanishing of the fermion mass was the result of chiral invariance. We shall 
show that the fermion mass can also be generated as a result of spontaneous sym- 
metry breaking. Let us assume that the Higgs field has an interaction with the 
fermion field. Adopting a Yukawa-type interaction and denoting the fermion field 
as tp , we can write the Lagrangian in Lorentz-invariant form 

-^Yukawa = fV L 1pR(/> + h.C. (18.113) 

The chiral invariance can be maintained by assiging suitable hypercharge to ip L , ^ R 
and (p, namely to satisfy — Y^, L + Y Vr + Y ( p — 0. When the symmetry is broken 
spontaneously, the Higgs field acquires vacuum expectation value, i.e. (p — >■ (v + 
0i)/V2, 

"^Yukawa = ( ^ + ^=0l) (? R Vl + ?r^l) 

m fv (18 - U4) 

v V2 
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Thus we have created the fermion mass term, but as a by-product we have also 
induced a fermion-Higgs interaction. There are no principles or guides to con- 
strain fermion-Higgs couplings, hence the fermion mass is an additional arbitrary 
parameter. This applies also to the mass of the Higgs particle itself (m^ = — 2/i 2 = 
21 v 1 ), which is proportional to the strength of the self-coupling constant I and has 
to be determined experimentally. 

What we have learned so far is summarized as follows. It is possible to generate 
mass terms in the gauge-invariant Lagrangian consisting of massless fermions and 
gauge bosons (referred to as the gauge sector). The mass term was generated by 
the interaction of the Higgs field with the gauge field as a result of spontaneous 
symmetry breaking. Its Lagrangian (Higgs sector) is also gauge invariant, as is seen 
from Eq. (18.107). Extracting the mass term only from the Higgs sector and adding 
it to the gauge sector violates the gauge invariance. But if we take into account the 
whole Higgs sector as well as the mass term, the gauge invariance is preserved. If 
the Higgs mechanism is right, we have to include the Higgs field in the calculation 
of higher order terms of gauge particle interactions. Above all, the Higgs particle 
itself has to exist. 



Glashow-Weinberg-Salam Electroweak Theory 

We are now ready to formulate a gauge-invariant electroweak theory. We discuss 
only (v c ,e~) doublets as before. Other doublets can be treated similarly. 

Higgs Doublet We have succeeded in constructing a gauge-invariant theory with 
finite mass of the gauge boson conceptually. What remains to be done is to in- 
corporate the Higgs mechanism into the weak interaction framework based on 
S 1/(2) x 1/(1). In order to do this, we need at least four Higgs fields. Three would 
disappear, giving mass to W and Z, and the remaining one would become a 
massive Higgs field. There are many ways to incorporate four Higgs fields, but the 
simplest one, which Weinberg and Salam (WS) adopted, is to assume the existence 
of an isospin doublet of a complex scalar field and its charge conjugate. We express 
them as 

= K+l , &c = - iT2 <P< = [-^°_ + ] (18.115) 

Then using the Nishijima-Gell-Mann law, we have Y(0), Y(0 C ) = 1,-1. When 
they acquire vacuum expectation values, we write them as 
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The Lorentz-invariant Yukawa interaction respecting S U(2) x U(l) symmetry i: 
given by 



= /, e R (*t»t) + (y L #)6 R 



but if c 

c instead of 



assumed to be zero, we do not include its mass term here, 
wants to give mass to the I 3 = +1/2 member of the doublets, one uses 



S U(2) x 17(1) The total Lagrangian for (v e , e~) and W ± , Z, y and the Higg 
respecting S U(2) x U(l) symmetry can be expressed as 



5 = W L iy"D fl W L + l R i Y "D fl e R - -F flr - F uv - -B flv B' ,v 
+ (D /( <2>)t(D"<Z>) - V(0) - f e [e R (&TV L ) + (W L &)e R ] 

= 3,, W v - d v W fl - g w W fl x W v 
= d u B v -d v B u 



V(<P) = 



2k) 



18e] 



The gauge sector is the first line of the Lagrangian, and the Higgs sector the second 
line. We need to rewrite W°, B^ in terms of Z /( , A^ . When the Higgs field acquires 
a vacuum expectation value, one can obtain the Lagrangian of GWS theory. It is 
equivalent to Eq. (18.95) derived previously plus the mass term and the Higgs field. 

Mass of W and Z The mass of W is already given in Eq. (18.99), but to derive 
the mass of Z, we need to know the Weinberg mixing angle and the Higgs field 
n expectation value. They can be obtained by substituting 



into the kinematic energy term (the first term in the second line) of the Higj 
sector in Eq. (18.118) to give 



The mass of W is also related to the Fermi coupling constant by Eq. (18.99). From 
this, if one can obtain the value of sin # w experimentally, m w , m z , v can be cal- 
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culated. Using experimentally observed value sin f? w = 0.2231 [311] 

, -Jit 1 / 37.3 GeV \ 2 , 

™V = ■ 2 „ = — ^— = (80.4 GeV) 2 (18.121a) 

8G F sm f% V sint% / 

m z = m ^ = 91.19 GeV (18.121b) 

v = 1/y V2G F = 250 GeV (18.121c) 

Problem 18.7 

Derive m w , m z from Eq. (18.118). 



18.6.5 

Summary of GWS Theory 

We have reached an amazing conclusion: that the mass is not an inherent property 
of a particle but one that has been acquired as a result of environmental change. 
The GWS theory is constructed on the gauge principle based on S 17(2) x U(l) 
symmetry that is spontaneously broken. Requiring the existence of the Higgs field, 
which is responsible for the symmetry breaking, allowed the finite mass of the 
gauge bosons as well as that of fermions to be incorporated in the theory without 
breaking the gauge invariance. It is a unified theory of the electromagnetic and 
weak interactions 19 ' and hence is referred to as the electroweak theory. Renormal- 
izabilrty of the theory was proved by 't Hooft in 1971 [359, 360]. Therefore, it is 
mathematically closed, self-consistent and has strong predictive power. All dynam- 
ical phenomena caused by the electroweak force are calculable to arbitrary accuracy, 
at least in principle. The theory succeeded in reproducing all phenomena known 
at that time and so its proof resided in the prediction of new phenomena. 



When Was the GWS Model Accepted as the Standard Model? 

It is interesting to note that such a revolutionary theory was barely noticed 
upon its publication. According to citation records, the number of citations 
was in 1967, in 1968, in 1969, 1 in 1970, 4 in 1971, 64 in 1972, 162 
in 1973 and 226 in 1974 [102]. This includes Weinberg's own citations. Why 
was such an outstanding paper not noticed? Because it was uncertain if the 
theory was renormalizable. Weinberg and Salam guessed that it would be if 
the symmetry was broken spontaneously but left it unsolved. It was clear that 
the sudden increase of the citation is due to 't Hooft's paper [359, 360], which 



19) It is not unified in the strict sense because S U(2) and [7(1) are not subgroups of a bigger group. 
The two coupling constants of S 17(2) and 77(1) are connected through Weinberg mixing, but there 
are still two coupling constants. It is unified in the sense that we can formulate two forces in a 
it and unified way. 
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provided a proof of renormalizability. Since then, Weinberg's seminal paper 
on the subject [383] boasts top citing to this day (6815 as of July 2009), sur- 
passed only by the Review of Particle Physics, which is a compilation of all the 
world data in high-energy physics (see [348]). The first experimental evidence 
supporting the GWS model was the discovery of neutral current reactions in 
1973 [199, 200], but what really established its validity beyond doubt was the 
discovery of Wand Z in 1983 with exactly the mass predicted [372-375]. But 
why did it take as long as 16 years to prove the model? I would argue that the 
year the model was accepted by the community was 1978. By then, the ma- 
jority already believed it, but there was one piece of counter evidence against 
it. Some atomic parity-violation experiments did not confirm the prediction 
that is now history. Convincing evidence to show that the GWS model, not the 
atomic experiment, was right was presented at ICHEP 78 (International Con- 
ference for High Energy Physics) by the SLAC group. It was the detection of 
parity violation in polarized electron scattering off a deuteron target [326, 327]. 
In addition, a compilation of measured Weinberg angles from a variety of 
periments were shown to agree with each other [41] and singled the Wein- 
berg-Salam model out from others. Ample evidence for QCD, including pre- 
cision experiments on deep inelastic scattering, was also presented at the con- 
ference. The next year, Glashow, Weinberg and Salam were awarded the Nobel 
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The first experimental evidence for GWS theory were the discovery of neutral cur- 
rent reactions [199, 200]. The fact that all the neutral current reactions can be re- 
produced with one universal parameter, the Weinberg angle, convinced people the 
Tightness of the theory. But, the definitive evidence was provided by the discovery 
of W and Z [372-375] having exactly the right mass as predicted. Since then many 
experiments, for example, SPEAR at SLAC (7.4 GeV, 1972-1990), PETRA at DESY 
(40GeV, 1978-1990) and TORISTAN at KEK (60 GeV, 1987-1996) confirmed the 
predictions of GWS theory. Above all, LEP at CERN (100 ~ 200 GeV, 1989-2000), 
in which the center of mass (CM) energy was set at */s = m z = 92 GeV, made a 
number of high-precision measurements confirming the higher order predictions 
of the electroweak theory. They could be compared with the Lamb shift experiment 
in QED in establishing the validity of the Standard Model to high precision (see for 
instance [184, 365]). 

Among hadron colliders, the S~ppS (Super Proton Synchrotron, J~s = 540 — >• 
630 GeV, 1979-1990) discovered W ± and Z in 1983. The Tevatron, which operat- 
ed 1983-2009, discovered the top, the last quark in the Standard Model. Both of 
them provided a variety of hadron jet data confirming many predictions of QCD. 
B factories (PEPII at SLAC, E = 3.1 + 9 GeV, 1998-2008, and KERB at KEK, 
E = 3.5 + 8 GeV, 1999 onwards) confirmed that CP-violation data fit well with 
the Kobayashi-Maskawa framework. The only major remaining issue in the Stan- 
dard Model is the discovery of the Higgs particle, which is expected to be found 
soon when the LHC starts operation. 1 ' 

All the known data so far, mainly obtained at electron-positron and hadron col- 
liders, have been consistent with the Standard Model. The only evidence pointing 
beyond the Standard Model is neutrino oscillation, but whether this phenomenon 
is outside of GWS theory is debatable and depends on the exact definition of the 
Standard Model. This only shows how perfect the Standard Model is, but at the 
same time it is a disappointment in the sense that progress towards new physics 
has been very slow. Therefore we will close this book by making a few comments 
on what lies beyond the Standard Model. 

1) The LHC is a p-p colliding accelerator at Jl = 14TeV and luminosity L of 10 33 -10 34 cm^s- 1 . It 
began operation in 2009. More discoveries mentioned later in this chapter are also expected. 
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Completing the Picture 



Generation Puzzle and Mass Hierarchy Despite its great success, one caveat of 
the Standard Model is that it is powerless in explaining the generation (or family) 
structure. The three generations of family (u, d, v e , e~), (c, s, v /t ,fi~), (t, b, v r , r~) 
seem to share the same propeties except their mass. The Standard Model cannot 
tell why there are three and only three generations. Their mass is also a mystery. 
Their mass hierarchical structure shown in Fig. 19.1 suggests a certain pattern, 
but in the Standard Model all the mass values are parameters to be adjusted. The 
mass is generated by the Higgs field and is expressed as m, = /; vj\fl GeV where 
v = I/a/V^Gf = 250 GeV is the vacuum expectation value of the Higgs field 
and fi is its coupling strength [see Eqs. (18.114)(18.121c)]. There are no criteria to 
determine or constrain the strength of the Higgs coupling hence they are arbitrary. 



Higgs Mechanism Another caveat is that the Standard Model does not have much 
predictive power for phenomena involving mass scales above ~ 1 TeV. This is be- 
cause it is essentially a low energy phenomenological theory valid below the elec- 
troweak scale characterized by v = 250 GeV. The Higgs particle with its asso- 
ciated mass generating mechanism is at the core of the Standard Model, but we 
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have no idea about the dynamical properties of the Higgs particle. Its Lagrangian 
was chosen for its simplicity and for theoretical convenience with no observation- 
al foundations. Unless we discover the Higgs and investigate its properties, the 
detailed dynamical structure of the Higgs condensation cannot be clarified. With- 
out the knowledge we cannot predict interactions above the critical temperature 
where the phase transition and electroweak splitting take place. Comparing to the 
superconductivity with which the Higgs mechanism can be compared, we have 
the Ginzburg-Landau theory, but not the BCS (Bardeen-Cooper-Schriefer) theory 
which clarified the dynamical mechanism of the superconductivity. 

To produce the Higgs particle and understand its dynamics is the most urgent 
theme in the high energy particle physics. Hopefully, LHC (Large Hadron Collider 
at CERN) will give us hints on them. 



19.2 

Beyond the Standard Model 

19.2.1 

Neutrino Oscillation 

So far the only firm experimental evidence beyond the Standard Model is neutrino 
oscillation, discovered in 1998. It is a phenomenon analogous to strangeness os- 
cillation, where a neutrino of one flavor is converted to another flavor (v e -> v u , 
V/i -^ v r , etc.) (for a recent review see [42, 87]). It happens if the neutrinos have 
nonzero mass and mix with each other. It also means there is a right-handed neu- 



Two-Body Oscillation The neutrino is distinguished by its flavor (v a = v e , v^, v T ). 
It is identified by its production in association with the corresponding charged 
lepton. Namely, the flavor states are the weak force eigenstates identified at the 
time of production. In general, they are not necessarily the mass eigenstates (v;, 
i = 1 — 3). If they are different and mix with each other, the flavor eigenstates are 
superpositions of mass eigenstates: 

|v«) = ^l7« J |v J ) (19.1) 

Then as each eigenstate has different time dependence, the mixing rate changes 
as time passes inducing appearance of different flavor eigenstates. Assuming that 
the neutrino is stable, and considering the smallness of the neutrino mass, we can 
express the time development as 

IMn> = E u «J>i> e ~ iEj '> E J = Jp 2 + m )-P + j£ ( 19 - 2 > 

For simplicity, we treat the oscillation between two flavors. Then there is only one 
independent mixing matrix element. In terms of a mixing angle 8 , the flavor eigen- 
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state can be written as 




|v e ) = cos 6\v 1 > + sin 0^2) 
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These equations show that oscillation 00 
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given by 

P {Vc ^v e ;t) = l-P(v e ^v /l ;t) (19.5) 

An optimum condition for the oscillation to be observed is given by A m 2 L/ E ~ 1. 
By careful choice of the value of L/E, a wide range of the mass region can be 
explored. The three-flavor-mixing formulae are different, but so far observed values 
do not change appreciably if analyzed in either way. 

Two kinds of oscillation have been observed. One is disappearance of the atmo- 
spheric neutrino [v fl (v M ) -► v T (v T ) : v x not observed], which was detected by 
comparing downward-going neutrinos with upward-going neutrinos [357]. The en- 
ergy of the neutrino £is 1-10 GeV, and the oscillation length L ~ 10 000 km (the 
earth's diameter). The other is that of the solar neutrino, (v e -> v^, v T ) where E 
is 1-10 MeV and I ~ 10 8 km [349]. The observed mass-squared difference and the 
mixing angle are given by 

A m 2 tm = 2.5 ± 0.5 x 10" 3 eV 2 sin 2 26» atm ~ 1.0 (19.6a) 

A m 2 Q = 8.0 ± 0.3 x 10" 5 eV 2 sin 2 2% ~ 0.86 ± 0.04 (19.6b) 

Note that if the three masses are hierarchical (m 3 » m 2 » mi), the mass differ- 
ence is a measure of the neutrino mass itself and m v < 0.05 eV is expected. 

Implication As the mass of a particle in the Standard Model is generated by its 
coupling with the Higgs, the difference of the mass is ascribed to the difference in 
the Higgs coupling constant. The observed mass ranges from ~ 171 GeV (m top ) 
down to 0.01 eV (m„). It is not likely that a single constant/mechanism can gen- 
erate masses differing by an order of 10 13 on the energy scale. Many attempts are 
under way to solve the mass hierarchy problem. The hierarchy is vertical (i.e. within 
a generation) as well as horizontal (i.e. across the generations). The vertical sym- 
metry is treated in the grand unified theories (GUTs), to be described soon. Some 
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seek horizontal symmetry. Yet the neutrino mass problem is special because of 
its tiny mass as is demonstrated in Fig. 19.1. A promising scenario to explain it 
is the see-saw mechanism [163, 395], in which the existence of a very heavy right- 
handed Majorana neutrino makes the mass of the left-handed neutrino light. The 
model requires the neutrino to be a Majorana particle, whose antiparticle is indis- 
tinguishable from itself, and that the underlying physics is of high-energy origin. 
This naturally leads to GUTs. 

After the discovery of neutrino oscillation, the importance of flavor mixing in 
the lepton sector and CP violation in the PMNS (Pontecorvo-Maki-Nakagawa- 
Sakata) matrix, which has the same meaning as the CKM (Cabibbo-Kobayashi- 
Maskawa) matrix in the quark sector, was recognized and at present various neutri- 
no experiments are being planned in many laboratories across the world [42]. They 
are expected to provide complementary knowledge to what can be obtained at the 
high-energy frontier. 

19.2.2 

GUTs: Grand Unified Theories 

GUTs were discussed long before the discovery of the neutrino oscillation. The suc- 
cess of the Standard Model immediately prompted the idea that the same recipe 
that had worked in unifying electroweak forces might also work if the strong force 
is included. There was another motivation to work on unified theories. The Stan- 
dard Model has over twenty arbitrary parameters, including the three coupling con- 
stants of the electroweak and strong forces, fourteen masses of the weak bosons, 
quarks and leptons, three mixing parameters and one CP violation phase of the 
Kobayashi-Maskawa matrix (to be extended to the MNS matrix, which includes six 
parameters, including two extra Majorana phases), the number of generations, etc. 
This is to be contrasted with general relativity, which includes only one parameter, 
Newton's gravitational constant. A unified theory should not have so many unde- 
cided parameters. 

Another motivation is theoretical, i.e. the absence of the quantum anomaly. We 
have learned that there are two ways to break the symmetry that exists in the La- 
grangian: explicit and spontaneous. In the former the symmetry-breaking term is 
put in by hand, for example by applying a magnetic field in an otherwise rotation- 
ally symmetric environment. The latter is realized when the ground state of the 
Lagrangian breaks the symmetry. A third kind appears only in quantum mechan- 
ics. When symmetry is conserved at the classical Lagrangian level, but broken when 
quantized, we talk about a quantum anomaly. When the anomaly exists in gauge 
theory, which happens in the Standard Model, the gauge symmetry is broken and 
the theory becomes nonrenormalizable. This is serious, but when the anomalies 
are added over all the members of a family in the Standard Model, they miracu- 
lously disappear. The vanishing of the anomaly is guaranteed by the fact that the 



n of the electric charge in a family adds up to zi 
J2 1i = (?» + Id) x 3(color) + q e + q Vt 



-m<-i)\> 



3 + (-1) + (0) = 



This fact suggests a strong connection between quarks and leptons. GUT models 
can naturally incorporate the mechanism of anomaly cancellation. 

Since the electroweak theory is a gauge theory based on S 1/(2) x U(l) and QCD 
on S 1/(3), it is natural to think of a larger group that contains S 1/(3) x S 1/(2) x [7(1). 
The first GUT theory was based on S 1/(5) [166], a minimal extension of the group 
that contains the Standard Model. Extrapolating the running coupling constant of 
QCD and others, it is generally agreed that GUT unification is only possible at 
extremely high energies ~ 10 14 -10 16 GeV, commonly referred to as the GUT (en- 
ergy) scale. Major predictions of GUT were the instability of the proton and the 
existence of the very heavy magnetic monopole at the GUT scale. The failure to 
discover the monopole prompted the inflation model, which later became part of 
standard cosmological theories. Naive GUT models like the one based on simple 
S 1/(5) are probably excluded by proton decay experiments (see for instance [252]), 
which did not observe the decay mode p — > e+jr° and set the upper limit of the 
lifetime as > 10 33— 34 /BR(p — >• ejz°) years. But GUTs that take into account super- 
symmetry predict that the dominant decay mode of the proton is p — »• K + v and 
are still viable models. 

19.2.3 
Supersymmetry 

Supersymmetry (SUSY for short) is a symmetry that connects fermions with 
bosons and considers them as one and the same particle but belonging to differ- 
ent states. If the symmetry exists, all bosons (or fermions) should have fermionic 
(bosonic) partners with the same mass, constituting super-multiplets. We know 
there are no such particles. Obviously supersymmetry is broken if it exists. Al- 
though no hints of supersymmetry are around experimentally, there are at least 
two strong theoretical motivations for its existence. 

One is the naturalness problem in GUTs. GUT models consider the strong and 
electroweak forces as united with a common coupling strength. At the GUT ener- 
gy (~ 10 1S GeV), the first spontaneous symmetry breaking occurs and the strong 
and electroweak forces are split. The second symmetry breaking occurs at the elec- 
troweak scale (~ 1/v^Gp ~ a few hundred GeV), where the weak gauge bosons ac- 
quire mass and are separated from the electromagnetic force. GUT models contain 
a variety of extra gauge and Higgs bosons (hereafter referred to as GUT particles) 
associated with the overarching symmetry, some of which have GUT scale mass in- 
duced by spontaneous symmetry breaking. When one wants to calculate quantum 
corrections of some observables, one has to deal with the two energy scales at ev- 
ery order of the perturbation series, which is referred to as the hierarchy problem. 
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For the electroweak observables, quantum corrections due to GUT particles with 
energy scale ~ 10 16 GeV have to be adjusted to the electroweak scale ~ 100 GeV. 
Since quantum corrections to bosonic mass like that of Higgs appear in quadratic 
form, one has to make a fine tuning of order (10 ls /10 2 ) 2 ~ 10 28 at every order of 
the perturbation calculations. This is considered unnatural and referred to as the 
"naturalness problem". 

Three types of recipes for its remedy have been proposed. One is to consider the 
electroweak gauge bosons W , Z as composites of more fundamental fermions; 
among these the most notable was the technicolor model. The binding energy plays 
the role of a built-in cutoff. Form factors describing the penetration effect of gauge 
bosons at GUT scale can be safely assumed to vanish. However, despite many ef- 
forts, no viable models have been built and the so-called technicolor models have 
been largely abandoned. 

The second is the addition of extra dimensions, which will be discussed later. The 
third is supersymmetry. SUSY requires that for every bosonic loop (i.e. higher or- 
der) correction, there exists a corresponding fermionic loop and they exactly cancel 
each other. If the symmetry is broken, it is still under control provided the break- 
ing scale is of the electroweak scale. All the corrections due to GUT particles are 
canceled away. There is one piece of circumstantial evidence in support of super- 
symmetry. The three coupling constants a (QED), a w [S U{2)] and a s (QCD) are 
running if quantum corrections are taken into account. The quantum corrections 
are calculated by including known particles in the loops. Their mass determines at 
which energy scale they have to be included. It was shown that the observed cou- 
pling constants, if extrapolated to GUT scale, do not meet together, in other words, 
the three forces do not unify. On the other hand, if supersymmetry is included 
somewhere above the electroweak scale, the number of particles contributing to 
the quantum corrections doubles and it was shown that all three coupling con- 
stants meet together [20, 253]. SusyGUTs (see Fig. 19.2), i.e. GUTs with built-in 
supersymmetry, are favored over non- SusyGUTs. 

The second reason for invoking supersymmetry is the desire to include gravity 
in unified theories. The symmetry operator changes fermions to bosons or vice 
versa and hence is a spinor. Amazingly, its commutator is identified as the energy- 
momentum operator. If it is gauged, i.e. if the operators are made local and include 
the corresponding gauge fields, they can accommodate general coordinate trans- 
formations, which means gravity is included. The quantized gauge theory of super- 
symmetry is referred to as supergravity. However, supergravity as a mathematical 
framework for unification has not been very successful. There is an inherent diver- 
gence problem associated with gravity being a tensor force. The problem may be 
avoided if one considers the particle not as a point but a string. This hope triggered 
the idea of superstring theory. 

If supersymmetry exists, every known particle should have its SUSY partner with 
the same mass but with spin differing by 1/2 (see Table 19.1). Since no such par- 
ticles are observed, supersymmetry is broken. But if the symmetry is broken at 
~ 1 TeV level, we expect the symmetry to recover at higher energy and that SUSY 
particles will be discovered with mass in the few hundred GeV region. Since SUSY 
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requires the superpartner to belong to the same multiplet as the known particles, 
the coupling strength is the same as their partners. Therefore, reaction topologies 
and their reaction rates are calculable if their mass is known. The design of the 
detector at LHC takes this into account. 

Table 19.1 Superparticles. 
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In the simplest supersymmetric model, each ordinary particle makes a doublet 
with its superpartner with spin differing by 1/2. In many models, SUSY particles 
are considered to carry R-parity to distinguish them from the ordinary particles. 
The left-handed and right-handed quarks and leptons separately make doublets 
with their superpartners having spin because conversion of one to the other is 
mutual and one cannot convert one SUSY particle with s = to q L and q R at 
the same time. Because of R-pariry the lightest SUSY particle is stable, which is 
usually assigned to the neutralino or gravitino. It is a prime candidate for dark 
matter, which is discussed later. 

19.2.4 
Superstring Model 

See [186]. The string as an elementary particle was first proposed as a model for 
the hadron. We have learned that the intra-quark forces share many characteristics 
with the string, Regge trajectories among them (recall discussions in Sects. 13.7.2 
and 14.5.3). Above all, it was pointed out that the duality of the Veneziano model 
was well explained by the string model. However, the hadronic string model had 
too many unwanted particles and it could not explain asymptotic freedom, so it was 
abandoned eventually. 

When the string model was reconsidered at the Planck energy scale (~~ 10 19 GeV), 
the unwanted particles at the GeV-TeV scale turned out to include gravitons, and 
the model was revived as a candidate for the unified theory of all forces, including 
gravity. The string model considers the fundamental particle not as a point par- 
ticle but a one-dimensional string (or even higher dimensional brane) and each 
particle species is one of its vibrational modes. It is a quantized gauge theory 
with supersymmetry incorporated in it, and also an extended general relativity in 
ten-dimensional space-time a la Kaluza-Klein. 

The reason for thinking that the string resides in ten-dimensional space-time 
is the absence of the quantum anomaly. As we live in four-dimensional space- 
time, the extra six-dimensional space is either curled up to a small size or our 
four-dimensional brane world floats in a ten-dimensional bulk. The string model 
even has the possibility of determining the space-time structure of the universe 
uniquely. Theorists became really taken with it and it is often dubbed the theory 
of everything (TOE). Many fascinating ideas have been proposed but are unfortu- 
nately untestable, because the model works essentially at the Planck energy scale. 
There are very few subjects if any that can be related to low-energy phenomena 
for experimental tests. If the only guiding principle is the beauty of the theory, 
there are too many possibilities and the research field runs the potential danger of 
becoming more like mathematics or philosophy than physics. We certainly need 
some observational guidance. 



19.2.5 

Extra Dimensions 

The idea of extra dimensions has already been mentioned in Sect. 18.2.8 in con- 
nection with Kaluza-Klein theory. All the superstring models have built-in extra 
dimensions, but here a different perspective on the extra dimensions is presented. 
Gravity is different from other forces in many respects. The graviton has spin 2, 
its strength is extremely weak, and it is closely related to the space-time structure. 
If we want to treat gravity on an equal footing with other forces, we have to go to 
the Planck energy, which is some 10 17 orders of magnitude apart in scale. The idea 
of extra dimensions [29] offers the possibility of solving this hierarchy problem. 
If space-time is 4 + D (D = 1,2,...) dimensional, gravity spreads in (3 + D)- 
dimensional space. Then Gauss's law tells us that the gravitational force flux (hence 
its strength) decreases ~ r _(2+D) . Let us denote the gravitational constant in (4 + 
D) -dimensional space-time as G D = M^ 2 . G D is the fundamental gravitational 
constant in (3+ D)-dimensional space just like G N = 1/M 2 j is in three-dimensional 
space. We want to have M D not too different from the electroweak scale, which we 
conveniently set at 1 TeV. The gravity under consideration should not contradict 
Newton's inverse square law. Therefore, we assume the extra D-dimensional space 
is curled up at the scale of R = (M D )~ 1 . Then the distance in the extra dimensional 
direction saturates at r ~ R and the power of the force propagation decreases to 
~ r -2 , recovering the usual Newton's law. The dimensional argument constrains 
the force as 

^ 1 Wl m 2 1 = m^ = 1-W 

M 2 r 2 (M D R) D r 2 M 2 { r 2 K ' 

t M 2 V /D / M„ \"(D + 2)/D 
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where mi, m 2 are masses of the particles introduced for convenience and which 
disappear from the final expression. Since for r <5C R, f ~ r _(2+D) , andfor r » R, 
f ~ r -2 , D = 1 is ruled out. However, Newton's law has never been tested at or 
below the scale of 1 mm, hence D > 2 is allowed. In other words, the fundamental 
gravity energy scale being ~ 1 TeV is not excluded if one allows extra dimensions. 
Then the hierarchy problem may not exist. 

The idea of extra dimensions triggered many models and tests of them have 
been proposed. The values listed above are somewhat more limited now, but the 
basic idea remains valid. One may refer to [311], p. 1272 for recent developments. 
In some models like one we described above, only the graviton lives in the extra 
dimensions and ordinary particles are confined to the four-dimensional Minkowski 
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space. In other models, gauge particles or all the Standard Model particles reside 
in the extra dimensions, too. Gravitons (denoted as G) are produced in e~ + e + ->■ 
y ( W ± , Z) + G or hadronic colliders (q + q -> gluon + G). Since they are emitted 
in the extra-dimensional space as well as in real space, the signal in the hadron 
collider is a monojet + missing energy. 
The momentum of the emitted graviton in the extra dimensions appears in the 



P 2 + J2p 2 i + m2 = iP 2 + m ™ < 19 - 9 ) 

which is referred to as the KK (Kaluza-Klein) tower. Its spectrum is discrete be- 
cause of the finite space size of the extra dimensions, but the spacing is small 
(A£ ~ fic/R ~ 10 _4 eV for D = 2) and is almost continuous. Therefore, al- 
though the production rate is small for one KK graviton, the phase space volume 
for emitting the KK gravitons is large and the total production rate is at a detectable 
level. The detection of virtual graviton effects and mini black holes has also been 
discussed. 

19.2.6 
Dark Matter 

The existence of dark matter has been clarified by astrophysical observations. It is 
so named because it does not radiate any electromagnetic waves and its existence 
is only detected by its gravitational force. 

The existence of dark matter was first suggested by Zwicky in 1933 [401] to ex- 
plain the violent random motion of galaxies in the Coma Cluster and revived by 
Rubin and others in their observations of rotational curves of spiral galaxies [337] 
in the 1970s. The motion of neutral hydrogens detected as 21.1 cm radio waves (see 
Fig. 4.1) showed that the rotational velocity of spiral galaxies, including the Milky 
Way, stays constant far beyond the boundary of shining stars and unambiguous- 
ly proved the existence of dark matter (Fig. 19.3). Since then dark matter has also 
been observed in many other places, including gravitational lensing. On the the- 
oretical side, the properties of dark matter have been investigated in conjunction 
with cosmic expansion to produce the observed large-scale structure of the galaxy 
distribution and cosmic microwave background radiation. 

Problem 19.1 

Show that the rotational velocity goes down ~ sjMjr if the mass is concentrated 
at the center and that to make the rotational velocity constant the mass of a galaxy 
has to grow ~ r. 

What are the properties of dark matter? It is electrically as well as color-wise neu- 
tral, which means that the interaction with ordinary matter is weak. Its properties 
do not fit with known particles in the Standard Model. The best candidate for dark 
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matter is the stable lightest neutral SUSY particle, known as the neutralino, which 
is a mixture of the superpartners of the photon, Z and Higgs particle. Another can- 
didate is the axion, a hypothetical very light (pseudo-) scalar p article introduced to 
explain the so-called strong CP problem in QCD [314, 315]. 2) From cosmological 
argument it is known that the dark matter is cold, i.e. its temperature is low or it 
exists as a mass cluster but not moving appreciably. 

Many experiments to search for dark matter are under way. The idea is to catch 
it in the halo of the Milky Way galaxy. The solar system, including the earth, is 
orbiting around the center of the galaxy, which arouses winds of dark matter. The 
signal is a tiny ionization loss liberated by collision of slow dark matter with the 
detector material. There is also a hope that it may be created directly in the LHC. 



19.2.7 
Dark Energy 



The existence of the cosmological constant in the cosmic evolution equation was 
suspected for a while by analysis of large scale structure formations. The first direct 
evidence was given by observation of far away type I supernovae, which exhibited 
the accelerating expansion of the universe [320, 331] (Fig. 19.4). 

Just as the Standard Model of particle physics was established in the 1970s, the 
.1 Standard Model (referred to as the concordance model) was estab- 
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lished in 2003 when the precision data of WMAP 3) was analyzed and a consistent 
view of the dynamical history of the universe was established [242, 351]. According 
to the concordance model, the energy of our universe is occupied mostly by dark 
energy and dark matter (see Fig. 19.5). The age of the universe was determined 
to be 13.69 ± 0.13 billion years [125]. Another outcome was that the space-time 
curvature of the universe was found to be consistent with zero. 

What is the dark energy? How do we know that it is there? It can only be proved 
indirectly by detecting the acceleration of the expanding universe and analysis of 
cosmic evolution. We will elaborate a bit on why we believe that dark energy exists 
by considering the cosmic expansion. It is governed by the Friedman equation 
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(19.10) 



where H = a/ a is the expansion rate of the universe, p = Pmatter+Pradiation+Pvacuum 
is the energy density, k is the curvature of space-time and A is the cosmological 
constant. The value of H at present time (denoted as H ) is referred as the Hubble 
constant, a is a scale factor and can be any spatial distance, but conveniently one 
might think of it as the size of the universe or average intergalaxy cluster distances. 

3) The Wilkinson Microwave Anisotropy Probe, which was launched in 2001. It has mapped the 
cosmic microwave background (CMB) radiation (the oldest light in the universe) and produced the 
first fine-resolution (0.2°) full-sky map of the microwave sky. Details can be obtained from [292]. 
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Figure 1 9.5 WMAP data reveal that the cos- 
mic energy content includes only 4.6% atoms 
the building blocks of stars and planets. Dark 
matter comprises 23% of the universe. This 
matter, different from atoms, does not emit 
or absorb light. It has only been detected in- 



directly by its gravity. 72% of the universe is 
composed of "dark energy", which produces 
antigravity. This energy, distinct from dark 
matter, is responsible for the present-day ac- 
celeration of the universal expansion. Credit: 
NASA / WMAP Science Team [292]. 



The equation is derived from Einstein's equation of general relativity. However, it 
can be made to appeal to intuitive understanding (albeit vaguely) if one derives it 
using only a knowledge of undergraduate mechanics and thermodynamics. Apply 
Newton's familiar equation of motion in a gravitational field 



-mv 2 -G = E (19.11) 

to a test particle (mass m) at the periphery (r) of a sphere speeding away from 
the center of the sphere anywhere in the universe. The test particle which can be 
imagined as a galaxy feels the gravity from the mass inside the sphere but not from 
outside. Convert v ->■ r, M ->■ (4jr/3)(p/c 2 )r 3 , enlarge the notion of mass density 
to include radiation as well as vacuum energy density, 2E/mc 2 ->• —k, and one 
gets the Friedman equation. 

Problem 19.2 

Show the critical density of the 






(19.12) 



and confirm the value using the observed value of present expansion rate H = 
73.3kms _1 Mpc -1 , 1 Mpc = 10 6 pc, 1 pc = 3.262 lightyears. The critical energy 
density is defined as that to give fe = in the absence of the cosmological con- 
stant. It is also the density to separate eternal expansion or eventual collapse of the 
universe without A. 
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The first term in the parentheses is familiar: Newton's universal attractive force. 
The pressure (P) in the second term gives the difference acquired by going to gen- 
eral relativity. The cosmological constant A, if positive, gives the repulsive force. 

Einstein's original motivation for introducing the cosmological constant was to 
make the universe stationary because he did not know the universe was expand- 
ing. The original equation without the cosmological constant was unstable, the 
universe perpetually expanding or shrinking. To make the universe static, he need- 
ed a counter term to cancel the attractive force. Later, when he heard of Hubble's 
discovery of the expanding universe, he abandoned it, saying it was his biggest 
blunder in his life. He found no reason to keep it in the equation. It is interesting 
to note that today's physicists take a different attitude toward the unknown. People 
are beginning to believe that what is mathematically possible is likely to happen, as 
is exemplified by Gell-Mann's words: "Everything which is not forbidden is com- 
pulsory". 

The pressure term could give a repulsive force if it is negative and dominates 
the first term in Eq. (19.14). Indeed, by inspecting Eqs. (19.10) and (19.14) and 
recalling that the vacuum has negative pressure P = —p (see arguments in boxed 
paragraph of Sect. 5.5.1), one can easily see that the cosmological constant can be 
identified as the constant vacuum energy density given by p vac = Ac 2 /(8jtG^). 
Now we have understood that the accelerating expansion (a > 0) means that the 
negative pressure permeates all over the universe. Note that, unlike a cluster of 
mass, which can exert force ~ M/r 2 , the vacuum energy is not clustered, and 
as can be seen from Eq. (19.14) the strength of the force increases linearly with 
distance. This is why repulsive gravity is not observed. Only at cosmic distances is 
sufficient vacuum energy accumulated to be perceptible. The cosmic expansion is 
the only phenomenon (so far) where repulsive gravity can be seen in operation. 

Problem 19.3 

Derive Eq. (19.14) using Eqs. (19.10) and (19.13). 

As is easily derived from Eq. (19.10), the constant and dominant energy density 
forces the universe to expand exponentially. It is believed that the universe went 
through such an epoch, known as inflation, before the thermal hot universe devel- 
oped. Since we really do not know if the acceleration is due to the cosmological con- 
stant, which stays constant as its name suggests, or some dynamical object, which 

4) Assumption of the uniform universe restricts net thermal flow to vanish in any sampled volume. 
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can generate time varying negative pressure, the name "dark energy" is adopted to 
include all possible candidates for the cause of acceleration. 

Note, in spontaneous symmetry breaking we learned in Sect. 18.6.2 that the vac- 
uum moves from (p = to \(p\ = v/V2. Then the difference in the potential 
energy V(0) — V(v/V2) is liberated as vacuum energy. Namely, a scalar field that 
can break symmetry spontaneously has the capability of creating vacuum energy. 
Cosmic inflation is believed to have been produced by such a scalar field, referred 
to as the inflaton. As the concordance model says that at present the dark energy 
dominates over all other types of energy, we are, sort of, in the second inflationary 
stage. The dark energy could be static vacuum energy (equivalent to the cosmo- 
logical constant) or of dynamical origin. If there is a self-interacting scalar field 
whose potential energy overwhelms its kinetic energy, it delivers vacuum energy, 
which varies with time. This is referred to as quintessence, which means the fifth 
element after the ancient Greek four elements of earth, wind, fire and water, or 
perhaps modern u, d, v and e? Whether the dark energy will eventually die away, 
as in some of the quintessence models, or stay constant, as is the case for the cos- 
mological constant, determines the fate of the universe. 

From a particle physicist's point of view, dark matter and dark energy are new 
challenges. What have we been doing? After all the long journey of the quest for 
the fundamental constituents of matter, what we have found occupies no more 
than 5% of the total cosmic energy budget. 



Appendix A 

Spinor Representation 



Definition of a Croup 

A group G is a set of elements with a definite operation that satisfy the following 
conditions. 

1. Closure: For any two elements A, B, a product AB exists and also belongs to 
the group. 

2. Associativity: For any three elements, (AB) C = A(BC). 

3. Identity element: A unit element 1 which does not cause anything to happen 
exists. For all elements A, 1A = Al = A. 

4. Inverse element: For any element A, an inverse A -1 exists and A -1 A = 



If AB = BA,thesetisca 

I Example A. 1 
Integers Z with addition "+" as the operation and zero as an identity operation. 
Inverse is subtraction. 

I Example A. 2 
Rational numbers Q without zero with operation multiplication "x" and the num- 
ber 1 as identity element. Zero has to be excluded, because it does not have 
an inverse element. Other examples are parallel movement or rotation in two- 
dimensional space (a plane). 
If AB 7^ BA, the set is called noncommutative or nonabelian group. 

I Example A. 3 
Space rotation 0(3) which is a member of more general group O(N) which keeps 
the length of the N-dimensional real variables space r = Jj2?=i x f invariant. 
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Croup Representation The operations of a group are abstract. By using the group 
representation they are transformed to a format that is calculable. A familiar repre- 
sentation is a collection of matrices U(g) that act on a vector space | W) e V, such 
that for all group elements g e G, there exists a corresponding matrix U(g) that 
satisfies 

U(A)U(B) = U{AB) 

U(A- X ) = U-\A) (A.l) 

17(1) = 1 (Unit matrix) 

The collection of operand vectors is called the representation space, and the dimen- 
sionality n of the vectors is called the dimension of the space. 
Note, hermitian matrices do not form a group unless it is commutative. 

Example A.4 

U(N) is a transformation that keeps the metric of the N-dimensional complex 

variables space I 2 = X!i=i \ u i\ 2 invariant. If we express m/s as a vector tp r = 

(Mi, m 2 , • • • , m n ), then N x N unitary matrices are the representation of the U(N) 

group elements. When det U = 1, it is referred to as the special unitary group 

SU(N). 



Subgroup When a subset of a group makes a group by itself under the same mul- 
tiplication (transformation) law, it is called subgroup. 

Subgroup of SU(N): Any S U(N) group has an abelian discrete subgroup H for 
which 

An element h e H , h = e i2 " k/N k = 0, 1, 2, •• • N - 1 (A.2) 

ordinarily designated as Z N . It is the "center" of the group whose elements com- 
mute with all other elements of the group. 

A.l.l 

Lie Croup 

When the group element is an analytical function of continuous parameters, it is 
called a Lie group. Any element of the Lie group can be expressed as 



'P^ 



where n = N 2 — 1 for the S U(N) group. F;'s satisfy commutation relations called 
Lie algebra. 

[F l ,F J ]=if Ijk F k (A.4) 



Fj's are called generators of the group, f;jk structure constants. Since the whole 
representation matrices are analytic functions smoothly connected to the unit ma- 
trix in the limit of 0; -> 0, the Lie algebra Eq. (A.4) completely determines the 
local structure of the group. 1 ' fijk can be made totally antisymmetric in the in- 
dices ijk. Determination of the generator is not unique, because different F- can 
be made from any independent linear combination of F;. 2) We are concerned with 
a compact Lie group in which the range of parameters is finite and closed within 
a finite volume in the space. For example, the rotation in a two-dimensional plane, 
< 6 < 2ji and is compact. In the Lorentz group, — 1 < v < 1 and is finite but not 
closed. If another parameter rapidity is used — oo < r\ < oo and hence, the group 
is not compact. 



We define a spinor as a base vector of S 1/(2) group representatio 
dimensional complex variable space. 



Then representation matrices are expressed by 2 x 2 unitary matrices with unit 
determinant. | transforms under S 1/(2) 

«-'-« - EH: m 

The condition for U is 

U _1 = LP , and det U = 1 (A.8) 

which constrains the U matrix to be of the form 



U = 



h\H 



(A.9) 

l S 1/(2) is three. Equation 



Therefore, the number of independent parameter 
(A.9) means 

Si = «li + b& 
, * * ( A - 10 ) 

1) An example of global structure is 2) For instance, instead of angular momentum 

"connectedness". For example S (7(2) and J,, we can use J± = J x ± ij y which satisfies 
0(3) have the same structure constants, 

but SU(2) is simply connected and 0(3) is [J+ ,]-] = 2J Z , [J Z ,J±] = ±J± 

doubly connected and there is a one-to-two / A 51 
correspondence between the two. 
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Taking the complex conjugate of the equations and rearranging, we have 

Htf- -(-ff) + Kff) ( , n) 

(i,*)'--fc* (-{;) + «*({,*) 

Therefore 

transforms in the same way as | . 

As there are three traceless hermitian matrices in S U(2), which are taken as 
Pauli matrices, a general form of the unitary matrix with unit determinant takes a 
form of 

r a i 9 

U = ex V [-i--0\=co S --sm-ia-n (A.13) 

If = exp U - ■ 1 = cos - + sin - ia • n (A.14) 

s; = Oi/2 are generators of the SU(2) group and satisfy 

[ Sl , S] ] = ie ijkSk (A.15) 

which is the same commutation relation as that of angular momentum. To see 
connections with the 0(3) (three-dimensional rotation) group, let us take a look 
at the transformation properties of a product of |'s, Hij = §;§*. Taking Hij as 
components of a hermitian matrix, it can be expanded in terms of Pauli matrices. 

H = «l+b.«r=al + [ bjt *«. h bx ~ b l ^=A+B (A.16) 

The matrix H transforms under S U(2) 

H -> H' = UH U r = A' + B' = a'l+b' a (A.17) 

A' = a'l= UAU< = UalU* = al .'. a' = a (A.18) 



It shows the coefficient of the unit matrix is invariant. To extract a, we take trace of 
both sides of Eq. (A.18) and obtain 

2« = Tr[H] = l^t + Ml = l£i| 2 + l&l 2 = s fct § (A.19) 

We see that S 1/(2) keeps the norm of the spinor invariant. Considering that the 
unitary transformation keeps the value of determinant invariant, 

detB' = detUBLP = detB /. h'l + b'* + b'l = b\ + b 2 v + b\ (A.20) 



A.2 SU(2) I 

This means the set of three parameters b = (b x , b y , b z ) defined in Eq. (A.17) be- 
haves as a vector in three-dimensional real space, i.e. a vector in 0(3). We shall 
show explicitly that components of b exactly follow the relation Eq. (3.3). 

To extract b x from Eq. (A.17), we multiply both sides of the equation by a x and 
take trace. Using Tr[<7 x ] = 2, Tr[a x ] = Tr[a x cT y ] = Tr[o x o z ] = 0, we obtain 

b x = ^Tr[CT x H] = -(o xJj )^^ = iJ(a x ,ij)S] = Vo x £ (A.21a) 

similarly 

b, = |t a? |, i z = |T ff ,| (A.21b) 

Therefore we can write 

b = |t a | (A.22) 

Let us show that b indeed behaves as a vector in three-dimensional real space. 
Operation of S 1/(2) transforms £-*■£'= U£. Rotation on axis z by transforms 

fe x -► b' x = f' 1 "^' = ^U J( o x U£ (A.23) 

-r(-! + -.f to .)-K-i--) { <"*> 

b' x = cos 6b x - sin0 fe y (A.25a) 

Similarly 

b' y = sin 6 b x + cos b Y (A.25b) 

fc' z = fe z (A.25c) 

Thus, b behaves exactly as a vector in 0(3). We may define a spinor whose bi-linear 
form (A.22) behaves like a vector. Loosely speaking, it is like a square root of the 

Composites of Spinors Let us consider products of two independent spinors £ and 
11, which may be thought as representing two particles of spin 1/2, 

'-[l]-'-M 

IV«=-^|(fll/l-&172) 
v *= TJ^i + fe^) (A.26b) 
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a scalar and a vector, namely as spin 



It is easy to show that s and v behave 
and 1 particles. 

From the above arguments, we realize that S U(2) operation on f and that of 
0(3) on vector (%, y, z) are closely related. Writing down explicitly operations on 
spin part 



U= e — e / 2 = cos^-sin^ ff -n^R= e -"- e =cos0-sin0iS.n 

(A.27) 

we see that while in S 1/(2), An rotation takes the spinor back to where it started, 
it takes 2tz rotation in 0(3). Between the range 2n ~ Ajt, U -*■ —U corresponds 
to the same operation in 0(3). The correspondence is 1 : 2. This is the reason 
that 0(3) realizes rotations only on integer spin particles, while S U(2) is capable 
of handling those of half integer spin. 

There are many reasons to believe that S U(2) is active in our real world. For in- 
stance, we know that quarks and leptons, the most fundamental particles of matter 
have spin 1/2. A direct evidence has been obtained by a neutron interference ex- 
periment using a monolith crystal which has explicitly shown that it takes 4jz to go 
back to the starting point [86, 103]. To confirm it personally, the reader may try a 
simple experiment shown in Figure A.l in the boxed pragraph. 



Confirm for yourself that 720° rotation brings you back to where you v 
but 360° rotation does not. 




Figure A.l Illustration of SU(2) topology. 
After a rotation of 4jr, the strings can be 
disentangled without rotating the tube, but 
it is not possible to do so with 2jr rotation. 



(a) No rotation, (b) The tube is rotated 
through an angle 4jr. The configuration is 
topologically equivalent to (a), (c)-(f) show 
how to undo the twists of the two strings. 
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A.3 

Lorentz Operator for Spin 1/2 Particle 

Referring to Eq. (3.77) and Eq. (3.74) in the text, combinations of spin operator S 
and boost operator K 

A=-(S + iK), B=-(S-iK) (A.28) 

satisfy the following equalities 

[Ai,Aj} = i£ ijk A k (A.29a) 

[Bi,Bj] = ie ijk B k (A.29b) 

[Aj, Bj'] = (A.29c) 

Both A and B satisfy commutation relations of S U(2), and they commute. In oth- 
er words, the Lorentz group can be expressed as a direct product of two groups 
S 1/(2) x S U(2), and hence its representations can be specified by a couple of spins 
(s A , s B ). Let us consider the most interesting case where one has spin 1/2 and the 
other 0. 

(0,1/2) A=0->K= + iS = -ft- (A.30a) 

(1/2,0) B = 0^K= -iS = -i- (A. 30b) 

Correspondingly there are two kinds of spinors which we denote L and </>r re- 
spectively. They transform under Lorentz operation as 

L -i <p[ = exp [-i| • - | • 7 ] L = M0 L (A.31a) 

R 4. 0' R = exp [-i| ■ + | ■ 9 ] R - N0 R (A.31b) 

which means that the two spinors behave in the same way under rotation, but 
transform differently under the boost operation. 

Note there is no matrix T that connects each other by transformation N = 
TMT~ l , i.e. in mathematical language they are inequivalent. Actually they are 
connected by 

N=£M*£ -1 , J = -i0 2 (A.32) 

A.3.1 

SL(2,C) Group 

Lorentz transformation on spinors is represented by 2 x 2 complex matrix with unit 
determinant and is called S L(2, C) group. 
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The matrix S comprises 6 parameters which are related to the three angles and 
the three rapidities in the Lorentz transformation. To see it, we look again at the 
transformation properties of a product of spinor components Hij = |i£*(i, j = 
1 ~ 2). In a similar manner as Eq. (A. 16), we decompose it to two parts, one with 
unit matrix and the other with Pauli matrices 

H = b ° i+w =[£°+^ V-li\ x h '= shsi < a - 34 > 

Here, as the matrix S is not unitary, S 1 ' S / 1, hence the first and the second term 
mix. However, the value of the determinant is invariant which leads 

detH' = deflSHS 1 ] = | detS| 2 detH = detH (A.35) 

.-. b ' 2 - b' 2 - b' y 2 - b' 2 = b 02 - b\ - b 2 - b\ (A.36) 

therefore S 1(2, C) operation has been proved to be equivalent to the Lorentz trans- 
formation. 

To see the difference between </> L and </> R , let us see transformation properties of 
two vector-like quantities made of constant matrices sandwiched between the two 
spinors. 

a f ' = <plo f '<p R , a,, = 0la„0 R (A.37a) 

V = <plo f '(p L , b fl = <pl~0 fl <p L (A. 37b) 

a 1 ' = (1, a) = a u , a" = (1, -a) = a fl (A.37c) 



S± = exp(± ? /a x /2) , S^ 
and define dashed quantities by 



then by the Lorentz transformation, they change ; 
[On -+o' = SLo,,S- 



(A.40) 



Calculating "0,1" components under a boost in the x-direction 

cr° (do') = S\ lS-t- = e " n = cosh rj ± smhrjO x (A.41a) 

a 1 (di) = S^_ cr^S-j- = (cosh — ± sinh — a x j a x (cosh — ± sinh — a x \ 
= (sinh rj ± cosh t]O x ) (A.41b) 
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The transformation is the same as Eq. (3.66). Therefore, we have proved that a fl , 
b^ are contravariant vectors and cfy, b fl covariant vectors. We often refer a 1 *, a* 1 as 
Lorentz vectors, but note that it is meaningful only if sandwiched between the two 
spinors. It is also clear that <p\<p-L, <j> L <j>R are Lorentz scalars. 

A.3.2 

Dirac Equation: Another Derivation 

Here, we try to make some correspondence between the spinors and particles con- 
sidering a special transformation with 0=0. We can consider a particle with 
momentum p as boosted from its rest state. Then there are two distinct states of 
the particle expressed as 

0l(J») = exp [~ • 7] 0l(O) (A.42a) 

R (j») = exp[+|. 9 ]0 R (O) (A.42b) 

Let 0> L , cp R be any two-component spinors behaving like 0l> 0r> then <p[ (1, — ct)0 l , 
<p R (l,a)(p R are contravariant vectors and their dot products with the energy- 
momentum vector (p[(E + a- p)cp L , <p R (£ — a ■ p)(j> R must be scalars. Considering 
<Pl<Pr: <Pr0l are also scalars, it follows that (E + a-p)<p L , (E — a-p)<p R behave like 
0r, 0l and hence must be proportional to them. Using a relation 

E + a- p = m(cosh t] + sinhr/a- n) = mexp(w- rj) (A.43) 

and putting the proportionality constant C, 

(£ + ff - j p)0 L (p) = mexp( < 7-i / )exp[-|-i / ]0 L (O) 

= mexp[+|-i,]0 L (O) (A.44) 

= C0 R (p)= Cexp[+|.jy]0 R (O) 

It follows that C = m and 0l(O) = 0r(O). The latter equality is physically a simple 
statement that at rest there is no means to distinguish (p L from </> R . Applying the 
same argument to (E — a ■ p)(p R and combining both results give 

(E + a-p)cp L (p) = mcp R (p) (A.45a) 

(E - a ■ p)<p R (p) = m0 L (p) (A.45b) 

This is the celebrated Dirac equation written in terms of two-component spinors 3 ' . 
Historically, the equation was created first and its solution followed. Here, we fol- 
lowed the inverse logic starting from the solution and found the equation. We see 

ext books, Dirac representation is 



{a,(S} D = T{a,p-} w T- 1 
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that it is a relation that 2-component spinors must satisfy and is the simplest one 
that a spin 1/2 particle respecting the Lorentz invariance has to obey. 

Multiplying (E — a-p) on both sides of (A.45a), and using relation (E — a-p){E + 
a- p) = E 2 — p 2 and (A.45b), we obtain 

(E 2 -p 2 - m 2 )cp R = 0, similarly (E 2 - p 2 - m 2 )cp L = (A.47) 

They satisfy the Einstein relation. If 4>l{p) and (pn(p) are expressed as a spacetime 
function (i.e. take their Fourier transform), they satisfy the Klein-Gordon equation. 

Problem A. 1 
Define 

S" v =^Iy",y v ] (A.48a) 

(S 2i ,S i \S u ) = (S 1 ,S 2 ,S i )= 1 -Z= 1 -[ a °] (A.48b) 

(S°\ S 02 , S 03 ) = (K U K 2 , K 3 ) = '-a = 1 \~ ia Q ,°] (A.48c) 

Show that S and K satisfy the same commutation relations as Eq. (3.74), thus 
qualify as Lorentz operators. 

Looking at Eq. (A.42), we see the plane wave solution of the Dirac equation is 
indeed nothing but a Lorentz boosted spin wave function at rest as it should be 
from our argument. 



Appendix B 
Coulomb Gauge 

The Lagrangian for an interacting electromagnetic field is given by 

£ = -\F fn ,F uv - qj^A" = -\(B 2 - E 2 ) - qj^A" (B.la) 

pnv _ 9 /* A v _ d v AM ^ A „ _ ^^ A j _ jV , _ ^ j j ^g lb j 

The Euler-Lagrange equation produces the Maxwell equation 

d fl F>' v = d fl d>'A v - 9" (S^A") = qj v (B.2) 

In the Coulomb gauge, we impose the gauge condition 

V-A = (B.3) 

Using the condition, the Oth component of the Maxwell equation becomes 

V 2 0(x) = -qp (B.4) 

which has solution with boundary condition </> ► 

Notice, the action of the charge at r' on a point x is not retarded, but instantaneous. 
Defining the conjugate momentum fields by 

ji° = -^ = (B.6a) 

n * = °^L = -A k -d k cp = E k (B.6b) 

(3 At 

The Hamiltonian density is given by 

X = -4^A„ -£ = -(E 2 + B 2 ) + E-V0 + q j„A" 
dAf, ' 2 

= \(E 2 + B 2 ) - qp<p + qj^A" = \{E 2 + B 2 ) - j ■ A (B.7a) 
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where in going to the second line partial integration was taken. This Hamiltonian 
seems to lack the Coulomb potential term, but actually it is hidden in the £ 2 . To 
see it, we set 

£n = -V0 , E± = -A (B.8) 

E = E± + £|| , V x £,| = , V • £_l = (B.9) 

by partial integration, we can show that 

|£| 2 = \E ± + £,|| 2 = \E ± - V0| 2 = \E ± \ 2 + |£,|| 2 

|£|li 2 = V0-V0 = -(V 2 0)0 = ? p</» 
Using the above relations, the Hamiltonian is now expressed as 

X = \{B 2 + E 2 ± ) + lp<f, - j ■ A ± (B.ll) 

The Coulomb term has reappeared. Notice the factor 1/2. The Hamiltonian can 
now be written as 



H = J d'x I"! {(V x A ± ) 2 + (A ± ) 2 - j ■ A ± } 

+£ ft iwjt&m 

2 JJ Ajz\x-x'\] 



(B.12) 



Thus the degree of freedom corresponding to the scalar potential is replaced with 
the instantaneous Coulomb potential and the rest are written in terms of trans- 
versely polarized potential. They are not manifestly Lorentz invariant, but a detailed 
investigation can show that Lorentz invariance is respected. Therefore, when intu- 
itive and physical interpretation is desired, the Coulomb gauge is convenient. But 
for easier calculation, the covariant gauge is more convenient. 



Quantization of the Electromagnetic Field in the Coulomb Gauge 

Since jt° = 0, the scalar component does not propagate meaning it is not a dy- 
namical object hence we cannot quantize A . We leave it as a classical potential 
and quantize only the three fields A according to the standard procedure with the 
conditions 

[A'(x),jrJ(y)] tx=ty = id ±IJ d\x-y) (B.13a) 

[A'(x), AJ(y)] t , =ty = [Jz'(x)jzJ(y)] tx=ty = (B.13b) 



6. 1 Quantization of the Electromagnetic Field in the Coulomb Gauge I 

The reason for using d±ij instead of d;j is to make the commutators consistent 
with the Coulomb gauge condition (B.3), so that both sides of the commutators 
are divergenceless. The field quantization in terms of the annihilation or creation 
operators is 



A <*>=^x: 






\a k , x e(k,X)e- 



[»M-»I'^] = 8w&u 



K*!/^ 1 )*^] ( B - 14a ) 
(B.14b) 
(B.14c) 
(B.14d) 



V -A = Q 



(B.15) 



Only the transverse polarization is included. The gauge condition requires 
(fc-e(M = ±) = 

The quantization is carried out in the same manner as two independent Klein- 
Gordon fields. The only subtlety is in the definition of transverse delta function to 
make the commutator relations consistent with the gauge requirement. 

The advantage of the Coulomb gauge is clearness of its physical image, but the 
disadvantage is that the form of the propagator and hence the calculation of the 
transition amplitude becomes rather complicated. To derive the Feynman propaga- 
tor in the Coulomb gauge, we introduce a timelike vector t] fl = (1, 0, 0, 0), which 
is orthogonal to the polarization vectors i.e. r\ • e A = 0. We form four orthonormal 
vectors 17^, e* e 2 k fl , where fc^ is a vector defined by 

i„-{k-v)VM 



k " \(k • V ) - fe 2 ]i/2 

As fe • e = r] ■ e = 0, it is easy to prove that fc is a spacelike unit vector, i.e. 
and orthogonal to the polarization vectors (k • £ = 0). Then we have 

In the covariant Feynman gauge, the propagator is expressed by simply 

Ei=o^*(fe)^(fe) _ -&.v 



(B.16) 



whereas in the Coulomb gauge the numerator is replaced by 
J2 <£*(k)e*(k) = -ft.* + V f ,v-kukr 

gf,v (fe • t]) 2 - k 2 (fe • rj) 2 -k 2 (k • t]) 2 - k 2 



Appendix C 

Dirac Matrix and Gamma Matrix Traces 



Dirac Plane Wave Solutions 

/= PuY" 
tf-m)u(p) = 0, tf+m)v(p) = 
u(p)tf-m) = 0, v{p)tf+m) = 

u\(p)u r (p) = v1(p)v r (p)=2Ed n 

u s (p)u r (p) = -v s (p)v r (p)=2md rs (C.la) 

Us(p)Vr(p)= V s (p)u r (p) = 

J2 u r {p)u r (p) = j+m, J2 MP)Vr(p)=P~-m 

r=±l/2 r=±l/2 

Gordon Equality 

u s (p V'MP) = ^-MP')[(P' + Pf + ia" v (p' - P)v]u r (p) 

z x m (C.lb) 

u s (p')y"y s u r (p) = —u s (p')[( P ' - pf + ia" v (p' + p) v ]y s u r ( P ) 

C.2 

Dirac y Matrices 

y^y" + y v y" = 2g f,v 

y 5 = iy°y 1 y 2 y i = --^£_z^ P ay ll y v y p y a 

(y 5 ) 2 = i. yV + y'V = o 

o" v = j[Y fl ,Y v ], o 0l = ia l , a l J = £ ijk a k 
y (y")V = yf , y°(y 5 fy° = -y 5 

y°{y 5 y f 'yy° = y V , yV'T' >° = ^" 
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Y 5 Ya = -y£ /J ypa7 f 'Y V Y P 

yPy v yP = gP v y P - g"Py v + g"PyP - iy S £>"'P a y a 

Charge Conjugation Matrices 

C = iy 2 y°, C T =0=-C, CO = 1 , C 2 = -1 

Cy^C' 1 = -y u , Cy SJ C~ l = y 5 

C(y V) T C _1 = y 5 y" . Ca^C' 1 = -cr'' F 

Chiral (Weyl) Representation 







(C.4a) 
(C.4b) 



Pauli-Dirac Representation 



[: :]./---B-g.r-/..-[4 ;] (C , 



Writing wave functions in the Weyl and Pauli-Dirac representations as <Py/> ^d. 
they are mutually connected by 

<Pn=S& w , y D = Sy w S- 1 , S = -J= T_J Jl (C.6) 

C.2.1 

Traces of the y Matrices 

y^yP =4 

7*4-7* = ~ 2 4 

y fl 4W=4(A-B) (C.7a) 

y„49(?y" = -2^4 
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Tr[l] = 4 , Tr[y"] = Tr[y 5 ] = 

Tr[y" 1 y" 2 ---y" 2 "+ 1 ] = 

Tr[y fll y f ' 2 •••/''"] = (-l)"Tr[y /i » • •• y'' 2 /'' 1 ] 

Tr[y / V] = 4g^ v 

Tr[y f, y v y p y°] = 4[g^' V" + g''V ~ g f ' P g v °] 

Tr[y 5 y''y v y p y ] = — 4ie >,vpa = 4i£ Mvpo 

Tr[y 5 ] = Tr[yV ] = Tr[y W'] = Tr[y Vy V] = 



C.2.2 

Levi-Civita Antisymmetric Tensor 




ijk = even permutation of 123 
ijk = odd permutation of 123 
if any two indices are the same 

+ 1 ; fivpo = even permutation of 0123 
fivpo = even permutation of 0123 
; if any two indices are the same 

X — fivpo , A = a /3 yd 
I = vpa, I = Pyd 
-g pd g ay ) 



(C.7b) 



Spin Sum of l^/;! 2 

K" v ( P ',p)= J2 [»s(p')y"(*-by 5 )u r (p)] 

x[u s (p')y^a-by 5 )u r (p)]* 
= Tr[y"(a - by s )(p'+ m)(a* + b*y 5 )y"(p" + in')] 
= 4[(|a| 2 + |b| 2 ){/V + p'V-g"V-,P)} 

+ (|a| 2 -|fo|V"'«''«-2iRe[(ab*) e '"'' , >;p ff ] 
For antiparticles, replace m or m' or both -► —m,—m'. 
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C.3.1 

A Frequently Used Example 



e(/»i) + H(pi) -*■ e(p3) + fi(pi) (1) W scattering; a = 1, b = 0, m x 
m 2 = m^ = M 

K'v'(Pi> Pi) = 4 [{Pi Pi + PiP'i ~ g f ' V {Pi ■ Pi)} + m2 S" v ] 

= 4 v'lPl + PiP'i + g''" ( y J . q = Pi-Pi 

Kv'{Pl,Pl)K V/l AP4'P2) 

= 16[2(jjj • p 2 )(P3 • ?4) + 2(pi • p 4 )(Pi ■ Pi) + q 2 {(Pi ■ Pi) 

+ (Pi ■ P4» + ? 4 ] 
= 8[(s - m 2 - M 2 ) 2 + (u - m 2 - M 2 ) 2 + 2t(m 2 + M 2 )] 
= 16 (s - m 2 - M 2 )(m 2 + M 2 - u) + t(m 2 + M 2 ) + — 1 
~ 8(s 2 + u 2 ) for s » m 2 , M 2 
The spin averaged matrix element becomes 

\M fi \ 2 =lj2\M fl \ 2 ^^ 2 -f { s 2 + u 2 ) 
The cross section in the CM frame is given by 



■[ 1 + (l±-£fl 



(C.lOc) 



q 4 2s <? 4 2 [ 

(2) LL, LR, RL, RR scattering; (L: a = b = 1/2, R: a = -b = 1/2) 

K L V (Pi'Pi) = 2 [{l>3Pi + PiPi-g f ' V (PiPi)} ~ i£ f ' vpo Pi P Pia] 
K V (Pi'Pi) = 2 [{P3K + PiPi-g f ' v (PiPi)} + i£ flvp °Pi P Pi<i] 
KC(Pi,pi)K Ulv (p A , Pl ) = K^(p 3 ,p 1 )K RMV (p 4 ,p 2 ) 

= 16( Pl ■p 2 )( Pi -p 4 )=4(s-m 2 -M 2 ) 2 
~ 4s 2 s » m 2 , M 2 
K V (Pi.Pi) k ^v{Pa,Pi) = 4"'(P3,Pi)Kl / „(P4,P2) 

= 16(p 1 -p 4 )(P2-P3)=4(u-m 2 -M 2 ) 2 
~4u 2 u» m 2 ,M 2 

(C.lOd) 
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e(pi) + e(p2) ->■ /«(P3) + jt(P4) Here, we consider only W scattering. Then, 
K' ,v (p', p) in Eq. (C.9) becomes 

Ky v {p', p)=4 [{p'"p" + p' v p^ - f v (p' ■ p)\ - m 2 g f,v ] 

r /« /v /m , ? ( cn ) 

= 4 [p p" + p p"- g f ' v [-)\ , s = (p + p) 
K f ; v (p 2 ,p 1 )K Vflv (p 4 ,p i ) 

= 16 [2( Pl ■ Pi )(p 2 ■ Pa) + 2( Pl • p 4 )(p 2 ■ p 3 ) 

- s{{pi ■ p 2 ) + (Pi • p 4 )} + s 2 ] , 
= 8[(t - m 2 - M 2 ) 2 + (u - m 2 - M 2 ) 2 + 2s(m 2 + M 2 )] 



£ 3 = £ 4 = J p j + M 2 , 

s = 4£ 2 = 4£ 2 = 4£ 1 £ 3 , v\ = '"^ , ?! = " 
(Pi-P3)(P2-P4) = (£ii?3-piP3COS0) 2 , 

(P1-P4)(P2-P3) = (£1£3 + P1P3COS0) 2 

Inserting Eq. (C.13) in Eq. (C.12), we have 

4"(p 2 , Pl )K V/ , F (p4,P3) 

= 16[4(£ 2 £ 2 + p\p\ cos 2 0) + s(m 2 + M 2 )] 

,(m 2 + M 2 ) 



= 4s 2 (1 + c 



; 2 0) + 4^ 



da 



, P3 1 1 (47ra) 2 
jjj 64jt 2 s 4 s 2 



l 2 <9 + 16 — 



(C.14) 
(C.15) 



The total cross section is 




4jt« 2 p 3 T (m 2 +M 2 ) 4m 2 M 2 l 

<7tot = "^— — 1 + 2- ' + j— 

3s Pi L s s 2 J 




4na 2 P f { l-ft-Pj (1-/5?)(1 


-^/)" 


3s /3, L 1 ' 2 ' 4 





(C.17a) 
(C.17b) 



where fiij — ptj/Eij is the particle's initial and final velocity in CM. 
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C.3.2 

Polarization Sum of the Vector Particle 

Polarization vectors (£'' (X)) are orthogonal to the momentum cf of the particle, the 
photon by the Lorentz condition and the massive boson by Proca's equation: 

q„e*(X) = (C.18) 

Only the transverse polarizations are allowed for the real photon, and they satisfy 

X>W(A) = <Jij-££ (C.19) 

Jt=± m 

For the case of the massive vector boson, 



£*ww = -(>-^) 



This can be understood by first considering the polarization vectors in the particle 
rest coordinate system, then boosting to the direction of the momentum. Define 
four polarization vectors £ f '(X), (X = ~ 3) in the rest frame of the particle by 

e /'(0) = (1,0,0,0) 

e f '(l) = (0,1,0,0) 

K ' K ' (C.21a) 

£^(2) = (0,0, 1,0) 

£^(3) = (0,0,0,1) 

Alternatively we can adopt circular polarizations for the transverse components. In 
the coordinate system where the 4-momentum of the particle is given by 

g* = (a>,0,0,|a|) (C.21b) 

The four polarization vectors are boosted to 



£"( + ) = 


-^(0,l, +l ,0) 


£"(-) = 


-L(0,l,-i,0) 


e"(J) = 


0Ao,o.m 



Here, we used names ± for circular, scalar for the 0th and longitudinal for the third 
component of the polarization. The direction of the longitudinal photon is parallel 
to the momentum. They satisfy the orthogonality condition 

e"(A)e„(A')* = g"' (C.21d) 
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and the completeness condition 

Y, g* x 'e»(X)e v (XT = «" v 



and the 



the scalar polar 

e. q 2 > 0), the above discussion 



Requirement of the constraint Eq. (C.18) 
completeness condition becomes Eq. (C.20). 

For virtual photons with time-like 
can be extended by simply replacing m by i/q 2 - 

For space-like momentum (q 2 < 0), at least one of the three polarization vectors 
has to be time-like. We cannot use £ f '(s) of Eq. (C.21c), because it does not satisfy 
the constraint Eq. (C.18). The condition Eq. (C.18) is satisfied by using £^(1) but 
replacing the mass with ^j—q 2 , because 



e(l),e"(l) = 



> 



(C.22a) 



What about the completeness condition? For space-like q fl , there is a coordinate 
system that makes \q\ ^ 0, q° = 0, because the energy momentum of the space like 
virtual photon is created by a charged particle, namely q = pi — p j where Pi, Pf 
are the time-like momenta of the particle before and after emission of the photon. 
In that coordinate system £ f '(l) = (1,0,0,0) which agrees with e''(0) defined in 
Eq. (C.21). Accordingly the metric for the summation has to be changed and the 
completeness condition is kept unchanged. 

£ (-1)'+V<(2K(2)* = - (> - ££) (C.23) 

£ f '(l) is time-like and is called scalar polari: 



Other Useful Formulae 



A + B+ C = (2ji) 4 6 4 (P -p 3 - p 4 ) 



11 (2jt) 6 2E 3 2E 4 
x [a + b u p>i + c^p^pZ] 
_ A(l,*,y)r 




(4P"P"-PV) 



2(1, x, y) = (I + x 2 + y 2 -2x -2y - 2xyf 
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Especially, when m 3 = m 4 = 

a b„ PP 

A= — , B = -£ — , 
Sji 16tt 

C=^(2P"P"+PV) (C25) 

To prove Eq. (C.24), use Lorentz invariance to show that 

A=X{P 2 ), B = b u P f, Y(P 2 ) , C = c uv [P f 'P r Z(P 2 ) + g f "'W(P 2 )] 

(C.26) 

and perform the calculation in the P rest frame. 



Appendix D 

Dimensional Regularization 



Photon Self-Energy 



We demonstrate the method of dimensional regularization by calculating the pho- 
ton self-energy loop diagram. 



J (2*)+ [ y V >k 



(2nf 

x W-4+**) 1 (D . 1} 

and prove that 

ff"V) = (g"Y - q f 'q v )n y (q 2 ) (D.2a) 

n y (q 2 ) = n Y (0) + fl y (q 2 ) (D.2b) 

in which n r (q 2 ) is a finite analytical function of q 2 . 
The trace calculation yields 

Tr[- ••] = Ti[()i+m)y v ()i- i+ m)y"] 

= 4[2fe"F - (k"q v + k v q") - g""(fe 2 -k-q- m 2 )] 



, r d 4 k 



^ [2k"k v - (k"q v + k v q") - g" v (k 2 -k-q- m 2 )] 

[k 2- m 2 + i£][{k _ q) 2_ m 2 + iE] 

which is a diverging integral for k ->■ cxd. The above integral Eq. (D.l) contains 
a 0(fe 2 ) term in the numerator, which is quadratically divergent. In dimensional 
regularization, it appears as a D = 2 pole, as the quadratic divergence in D = 4 
becomes logarithmic in D = 2 space. The gauge invariance, however, guarantees 
that the divergence is at most of logarithmic type, as is shown below. Since the 
integral is well defined for D < 2 and is an analytic function of D, it is allowed to 
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assume that D is an integer larger than 4, perform the integration formally, then 
take the limit D -> 4. 
To perform the integration in D-dimensional space, we introduce 

k 1 ' = (fe°, fe 1 , fe 2 , fe 3 ) -+ (fe°, fe 1 , • • • , k D ~ l ) = (fe°, k) (D.4a) 

d 4 fe , n d D k 

g^ v g/.v = 4 -+ g" v g„ v = D (D.4c) 

D-l 

fe 2 = (fe°) 2 -k 2 = (fe ) 2 - ^(fe 1 ) 2 (D.4d) 

fi is an energy scale parameter introduced to keep the physical dimension. Al- 
though introduction of fi is redundant in the following argument, we keep it be- 
cause it is indispensable when we extend our discussion to QCD. We introduce the 
n for later convenience: 

(D.5a) 



= h (i)(2; 

Jk i J (2^) D 



k (1)(2) 
where we have defined 

1 d D k 

(l) = k 2 -m 2 + ie, { 2) = (k-q) 2 -m 2 + ie (D.5c) 



(D.5b) 



Applying the Feynman parameter formula 



s-f 



[(l-x)a + *fe] 2 
The integrand can be transformed to a more convenient form for integratio 



-«"•—£*/& 



(2jr)« 

x [2fc"F - (fc"q" + FQ - g""(k 2 -k-q- m 2 )} 
[{k _ xq) 2_ (m 2_ x{1 _ x)q i )] 2 

Let us calculate B first. It is expressed as 

f 1 16jt 2 a n f 1 , d D k 1 

B ° - /, m - — " /. '"i2^ [ (t -, < |.- M .p <■»■> 

M 2 = -q 2 x(l-x)+ m 2 -ie (D.8b) 
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Changing the integration value from k ->■ V = fc — xq, and introducing w 2 = 
^^/(fe') 2 + M 2 , we find the denominator becomes 

(fe0)2 _ ^2 + . e = [fe _ (a _ i£)][k _ { _ w + i£)] (D 9) 

It has poles at w — i e and —cu + ie. Therefore, we can rotate the integral path from 
fe° = —oo ~ oo —> —ioo ~ ioo, which is called the Wick rotation. Then writing 

D-l D-l 

fe 2 = (fe°) 2 -E^- r2 = E( fe ') 2 ( D - 10 > 

As r is the length in the Euclidean space, we can adopt the polar coordinate system 
and integrate all the angular variables, giving the D-dimensional solid angle Q D . It 
can be calculated as follows. Using the equality 



/ dxe x = ~JH =>• / dxi--- / 



recalculating i 


t using the 


polar coordinate 


Q D f 

Jo 


e~ r ' ''r D ~ 


Hr = Jt°l 2 


Using the definition of the f function 


/» = 


■!?-' 


x - l dt 


we obtain 






i3o = 


In ' 1 

r(D/i) 





Now we can perform the integration in Eq. (D.8). At this point, we extend the power 
of the denominator from two to more general n for later convenience. Then in the 
Wick rotated coordinate, we obtain 



f d D k „ f d D k „ r°° r D_1 d 

_ ( w^.(£.-f) 



(D.15a) 



B l p, q) =rdt tq ~\ = r{P)F{q) (D.lSb) 
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Then 



_ 167ty- D (-l)"t /- 1 2tt d / 2 M p - 2 "r(D/2)r(n-D/2) 
° » (2*)0 Jo r(D/2) 2 /» 

(D.16a) 

1 ' Jo L (M 2 )"" D / 2 J /» l ' 

As one can see, n = 2, D = 4 is a pole of the T function. The residue at the pole 
can be calculated by using infinite product representation for F 

m =z ""US 1+ ^'"" |D17) 

Setting n — 2 and defining an infinitesimal number s = 4 — D, and using 

r Q = - g -Yt+0[e) (D.18a) 

x e/2 = e (e/2)hx = 1 + i inx+ Q( £ 2) (D.18b) 

B {q 2 ,m 2 ) = A- dx\nM 2 

= d- I dx ln[-q 2 x(l - x) + m 1 - is] (D.19a) 

A = - - y E + ln(47TM 2 ) (D.19b) 

where y E — 0.5772 is the Euler constant. The second term in B is a finite analytical 
function. The divergent term is included in the first term as the pole of the complex 
D-plane. y E and \n(4jTfi 2 ) in A always accompany 2/e as a unit and cancel in the 
observable quantities. Therefore, it is to be subtracted together with 2/e in the 
renormalization prescription. In QCD, where there is no special point to normalize 
the coupling constant with the experimental value, In/* 2 is kept as an adjustable 
parameter. 

To calculate other terms in Eq. (D.7), the following formulas are useful and easy 
to verify: 



J [fc 2 -M 2 + ie]« 

[ l2 k " d 2 Dk = 

J [k 2 -M 2 + i£]« 

r k 2 d D k 

J [k 2 -M 2 + i 





-°/ 2 /» 




(D.20a) 
(D.20b) 


i(jrfl 2 (-ir 


-x d/2 r( 


»-l- 


f) 


( M 2)„-l-D/2 


/» 
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( ]2 k " kV f k = S -[ 12 k2d : k (D.20d) 

J [k 2 -M 2 + i£]« D J [fc 2 -M 2 + ie]" l ' 

To derive the third equation, r(z + 1) = zT(z) is used. The last equation can be 
proved by noting that the integral must be proportional to g' ,v by Lorentz invariance 
and that g f,v g^ v = D. 

To calculate 77 / "', we rewrite fe in the numerator of Eq. (D.7) with k /ft = k f ' — xq fl 
and drop terms linear in k' as they vanish: 

Jo J (2?r) 4 
2fe'' fe" - (k''q v + k r q' 1 ) - g>"'{k 2 - k ■ q - m 2 ) 
X [(k - xq) 2_ {m 2_ x(1 _ x)q 2 )] 2 



2 f l J f d D k' 
Jo J (2jc) d 



(2;r 
2fc'"fc' v - 2x(l - x)q"q" - g uv [k' 2 - q 2 x(l - x) - 



Inserting Eq. (D.20) , we obtain 

+ {-2%(1 - x) q "q» + g""[q 2 x(l - x) + m 2 })f (2 - |)] 

(D.22a) 

As (D/2 - 1)T(1 - D/2) = -F(2 - D/2), we see that the pole at D = 2 which rep- 
resents the quadratic divergence vanishes. Reduction of the power in the divergent 
term by the gauge invariance has worked! 

= ^VV - <A V ) / d * *(1 - x ) \ A ~ MM 2 - is)] 
n Jo 

= (g"V -<?'V) 
x [^ " T I d * * (1 " x) M-? 2 *^ - *) + ™ 2 - i£ ]] 
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Using 77'"' = (gf'q 2 — q f 'q y )n y ,we finally obtain 

77 y = 77,, (0) + fl y (q 2 ) (D.24a) 

77 y (0) = ^- (A - In m 2 ) (D.24b) 

x(1 _ x)l J 1 _ -^ x(l-x)-ie] (D.24c) 



it h 



Electron Self-Energy 

The electron self energy is given by 

eU ^ 7 (27r) 4 (p-fc) 2 -/* 2 + ie l Y >k 2 -m 2 + i£ y Y ' K ' 

where fi is a small mass introduced to avoid the infrared divergence. Using the 
Feynman parametrization, it becomes 

J J (2xf[(k-xp) 2 -M 2 + i £ } 2 

Jo J (2jt)* [fc 2 - M 2 + is] 2 K ' 

M 2 = -p 2 x(l - x) + (1 - x)m 2 + xfi 2 - is (D.26b) 

where we have used the relation y„ jiy" = —2 j£, y^y" = 4. This is the same 
integral as Eq. (D.8) except for the numerator and its result is already given by 
Eq. (D.19). Thus 

Ie{P) = h\o dx ( 2m ~ x ^ H + ln m2 ] P.27) 

We can easily calculate 6 m and <5 Z 2 defined in Eq. (8.68) 

Z e (p) = dm + [dZ 2 + fl e (p')](P f - m R ) (D.28a) 

dm = 2,(j>)|,- Bji = ^m* f-^ + jf dx (2- x) In Ml] + 0(a 2 ) 



D.2 Electron Self-Energy 


1831 


In deriving the last equation we used m = m R + dm = m R + 0(a). 




° Z 2 = a , 




- a \l A Cdx^xlnM 2 1 2~i*(l-*)(2-*)n , o^ 




~ 2^ L 2 A d *j* lnM * + M 2 jJ + 0(«) 




(D.28c) 




M^ = (l - x) 2 m\ + xfi 2 - i e (D.28d) 




In calculating Mj, and its derivative, we have used p 2 =// and 




M^ = M 2 |, = mK = -m\x(l -x) + (l- x)(m R + dm) 2 + xfi 2 - ie 




= (l-x) 2 m 2 R + xfi 2 -i£+ 0(a) (D.29) 





We see, the first derivative of S e (p) still includes the divergent term. But fl c (^) is 
finite and satisfies the relation 

iW)l#-»i = o (D.30; 
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Rotation Matrix 



Angular Momentum Operators 

An operator to rotate a physical object by an angle a around the axis having a unit 
vector n(0 ,(p) is given by 

R = exp[-a(n-J)] (E.l) 

The angular momentum operator J; = L ; + S; satisfies the commutation relation 

lJuJj] = ie iji Jk (E.2) 

J 2 = j\ + Jj + J 2 commutes with all /;, and has an eigenvalue j(j + 1), j = 
0, 1/2, 1, • • • . One of J; (conventionally J 3 ) and J 2 can be diagonalized simulta- 
neously, so the eigenvector can be specified by two values, j, j 3 . Writing J± = 
h ± iji 

J 2 \jm) = j(j + l)\jm) 

h\jm) = m\jm) (E.3) 



J ± \jm) = Jj(j + 1) - m(m ± l)|j m ± 1) 

For spinless particle, J = I and 

£; = XjPfc - XfcPj = -i(*/9fc - *fc 9 j) ( E - 4 ) 

Eigenfunctions of the orbital angular momentum are spherical harmonic func- 
tions. They are, for I 2 = 1(1 + 1), I 3 = M, — I < M < I, given as 

y IM (0,0) = ( _i)(M+IM|)/2 [ 2L+l(L-\M\)\ f 2 p \M\, d) iM* (E5a) 

^'(^(l-r^^^-l) 1 (E-5b) 

Y Li _ M (0,0) = (-1) M [Y LM (0,0)]* (E.5c) 
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V 32jt 
rT 5 cos 3 < 



n 0(5 cos 2 6 -l)e ±ut > 



For I = 0, J = S, and S, are (2S + 1) x (2S + 1) 
where <7 is the Pauli spin matrix. 



The eigenfunctions are 

For S = 1, we can adopt eigenfunctio 



For S = 1/2, S = <7/2, 

(E.7) 



in o" 
= 2 [o -1 



For which we can easily construct S± using Eq. (E.3) 



TO V2 

S+ = V2 

I 



r o o oi 

= U/2 , 

L V2 oj 



10 

0-] 
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E.2 

Addition of the Angular Momentum 

Adding J 1 and J 2 vectorially 

J = Ji + Ji (E-12) 

The range of the eigenvalues J, M are 

\ji-J2\<J<ji + J2. M = m l + m 2 (E.13) 

The eigenvectors of the J are given by 

\JM)= J2 ^n^n^hm^hmi) (E.14) 

C- m ■ are called the Clebsch-Gordan coefficients and their general forms are 
given in [334]. For the case, ji = 1/2, 1 they are given by 



Table E.2 C J J " M _ m . y2 m . 



J\m 



l "J'+ M + 1 ) 1 ' riiirMU 

L (2ji+l)(ji + l) J L (2jl+l)(2jl+2) . 

M [ (jl-M)(ji + M+l) ' 

[jKji + D] 1 ' 2 



[ Ul+ a^ +M) ] 



Rotational Matrix 

The effect of a rotation on the z-axis simply gives a phase factor to the eigenfunction 
| j m), but that on y-axis generates states of different m. As, | j m) makes a complete 
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system if the value of j does not change, the n 
linear combination: 



- ia ^\jm} = Y l dL(e)\jn) 



v state can be expanded by their 
(E.15) 



d J nm {0) is called the rotation matrix and is defined by 

d j nm (0) = (jn\e- w ^\jm} 
The properties of d nm function are 

dL(0) = dL m ,- n (d) = (-i)"" m dL(0) 

d ] nm (0) = (-iy +n dl- m (ji - d) 
dL(jt) = (-l)J + "<5„,_ m 
dL(-n) = (-1)"-J'a„,_ m 
The normalization is given by 



I dcos9dL(6»)^ 2 m (6») = 



2ji 



A general formula of rotation matrices are given by Wigner [334], but for j — 
1/2, 1, it is easily obtained by inserting Eqs. (E.7) and (E.ll) into Eq. (E.16): 



r d 1/2 d 1/2 1 

"1/2,1/2 "1/2,-1/2 

d V2 d 1 ' 2 

L"— 1/2,1/2 "l/2,-l/2j 



e Tcos f 

COSy-sin-(KT 2 ) = 



Slightly more general formulae are given in [217]. 
j = L = Integer 

i[ o (0) = -[L(L + I)]' 1 ' 2 sin 6 P' L (d) 

4>(«) = [(I " 1)1(1 + 1)(I + 2)r l,2 [2P' L - 1 (0) - 1(1 - l)Pi(0)] < E ' 21 ) 
P((z)=^-P L (z) 
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j = L + l/2 

4pmi(0) = (j + 1/2)" 

^ 1/ 2,l/2( ) = (j+ 1 /2)- 
<2,3/2(^) = (J + I/ 2 )" 
^X/2,3/2^) = 0' + V2)~ 



S 2 ( P j + l/2 P i-l/2> 
4(^+1/2 + ^-1/2) 

a e ( I j-v 2 P > 



J+3/2 p 
/ J " 1/2 



j-1/2 j+3/2 

j+3/2 P M + ij-l/2 P 



Appendix F 

C, P, T Transformation 



We summarize here the transformation properties of fields. Only the Dirac fields 
and their bilinears are given. Corresponding scalars, vectors etc. that obey the 
Klein-Gordon equations are similar to the corresponding Dirac bilinears. | q, p, s z ) 
means the state of a particle (q) or antiparticle (q) having momentum p, and spin 
component s = s z . (t, x) stands for (t, x). Combined operations usually give differ- 
ent phases depending on the order of the operation. But the Dirac bilinears do not 
because t]rj* = 1. 



(F.l) 



(t,x) 


-► 


(t,-x) 


P\q,p,s) 


= 


t] P \q,-p,i 


V(t,x) 


-> 


y°ip(t,-x 


V(t,x) 


->■ 


M>(t,-x)y { 


Vl1>2 


-> 


^1^2 


iiPiY 5 ip2 


->• 


-iVi/ 5 V: 


H>iY"H>2 


->• 


ViY, t i>2 


hv f 'r s n>2 


-► 


-y>iYuY S * 


Ip^O^lpj 


->■ 


Vl<V^2 



is attached to make ixpy^xp hermitian. 



ip(t,x) 


->■ 


Cy> (t,x) = iy 2 y> (t,x) 


ip(t.x) 


->■ 


-^(t.x)C- 1 


C- l yf'C 


= 


-yf' T 


C = -C 1 


= 


-C~ l =-C T = c* 


V^lVz 


-> 


?2^1 


ixp x y s ^2 


-► 


iViY*V>i 


n>xY ,l n>2 


->• 


-ip 2 yi*ipi 


ipiY"Y S *p2 


-► 


ip 2 y>'y 5 ipi 



o. KGaA, Weinheim 
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(t,X) -> (-*,*) 

(c-number) -+ (c-number)* 

(/:out|H|i:in) = (/ : out\T- 1 THT~ 1 T\i : in) 

= <T/:in|H|Ti:out>* 

T\q,p,s) = rj T \q,-p,-s) 

y(t,x) -> By>(-t,x) 

= iy 1 y i tl>(-t,x) = -iy 5 Cip(-t, 

ip(t,x) -> Tp(-t,x)B~ 1 = 7p(-t,x)y 1 y 3 



iipiY 5 Tp2 -> 


-i 


^l7 5 ^2 


Vi7"V2 -> 


^iJV^ 


iptfity 5 ^ ->• 


Vl 


.YnY V2 


V*i(^"V2 -> 


-Vi^-V'a 


(t,x) -* (t 


.-*) 




CP|q,p,s) = ?? 


c/.|q. 


-P,s) 


V(*.*) -* ± 


i'v'V 


'V-) 


V(t,x) -> T^ T (t 


,-x)iy 2 


^1^2 ->■ V s 


2V1 




iy^ipj -*■ - 


j^ 2 y 5 ^! 


*PiY f '*Pi ->■ - 


V 2 7/^i 


Vi/V^ -► - 


VzwVi 


jp 1 a>' v tp 2 -> - 


'/'2 fJ / 


,rV'l 


(*.*) 


-+ 


(-*,-*) 


(c-number) 


->■ 


(c-number)* 


(/:out|H|i:in) 


= 


(/: out|6> _1 6>H6> _1 ( 




= 


<0/:in|H|0i:out)* 


©|*,j>,s) 


= 


rj \q,p,-s) 


V(t.*) 


->• 


±iy y 5 ^ T (t,-%) 


V(*.*) 


"+ 


±ip T (t,-x)iy y 5 


^iVz 


-► 


^2^1 


iVi/ 5 ^2 


->• 


^ 2 y 5 Vi 


^/"Vz 


->• 


-V 2 y''Vi 


Vi/^y 5 ^ 


->■ 


—ipiY^y 5 ^! 



Appendix C 

SU(3), SU(n) and the Quark Model 1 ' 



Generators of the Croup 



The set of all the linear transformations that preserve the length L = Jj2f=i \ u i\ 2 
in the N-dimensional coordinate space of N complex variables u ; (i = 1 ~ N) is 
called U(N). With the further restriction of the determinant being 1, it is called 
SU(N), a special unitary group. The number of independent variables is N 2 for 
U(N) and N 2 — 1 for S U(N). Any representation matrix of the S U(3) group operat- 
ing on n-dimensional vectors can be expressed as n x n unitary matrices, which is 
a function of N 2 — 1 independent continuous variables 0, and traceless hermitian 

S F; 



U = exp ( i J^ e i F i\ • F l = F » Tr [ F >] = ° > 



The traceless condition makes the determinant of the matrix equal to one. It car 
be proved as follows. If we write U = e l 2, Qis a hermitian matrix. Therefore, i 
can be diagonalized by using some unitary matrix S~ 1 QS = diag{ Q). Then 

det U = det[S- 1 exp(iQ)S] = det ^S _1 ^-S 

= det [E (S " 1 n p S) " l = det[diag {e 'e } ] = [I •'" (G . 2 



= expliX;'7ij = ^ e] (= 1 if Tr[Q] = 0) 

where rji's are eigenvalues of the diagonalized matrix Q. N 2 — 1 F;'s satisfy com- 
mutation relations called Lie algebra: 

1) References are [89, 101, 156, 165] 
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The Fi's are called the generators of the group and fijk are structure constants. 
Since all the representation matrices are analytic functions smoothly connected to 
the unit matrix in the limit of &i -*■ 0, the Lie algebra Eq. (G.3) completely deter- 
mines the local structure of the group. 2 ' fijk can be made totally antisymmetric 
in the indices ijk. Determination of the generator is not unique, because different 
F\ can be made of from any independent linear combination of F;. 3) 

Let us write tp as representing an n-dimensional (n > N) contravariant vector 
(with upper index) on which the nx n SU(N) matrices act. Then tp is transformed 
to ip' by U. 



ri 1_ | i 1 

: > V = ■ 



£ a, = U a i £ h , U = e- ie ' Fi (G.5c 

where simultaneous appearance of indices means sum over the indices up to n 
The collection of n matrices with n = N is called a fundamental representatioi 
and adjoint (or regular) representation if n = N 2 — 1. 

C.l.l 

Adjoint Representation 



The representation matrices with n = N 2 — 1 can be expressed in terms of the 
structure constants. 



U(N 2 - 1) = [F a ] hc = -if ahc , (a, la, c = 1 ~ N 2 - 1) 



(G.6) 



Proof: Insert A = [F a ], B = [F h ], C = [F c ] in the Jacobi identities 

[[A, B], C] + [[B, C], A] + [[C, A], B] = (G.7) 

use Eq. (G.3) and f a hc — — fbac, then we obtain 

fabdfae + f bed fade + fcadfhde = (G.8) 

Then insertion of Eq. (G.6) into the above equality leads to 

[F a , F c ] = if acd F d (G.9) 

Therefore, F a defined in Eq. (G.6) is an adjoint representation of the S U(N). □ 



3) Fori! 



"connectedness". For example 


S U{2) and 


0(3) have the same structure 


constants, 


but S U{2) is simply connecte 


1 and 0(3) is 


doubly connected and there is 


a one-to-twc 


correspondence between the t 





:e J± = J x ± J y , which satisfies 
2J3, L/l. J±] = ±J± (G.4) 
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If the Lagrangian or the equation of motion is invariant under S U(N) transfor- 
mation, Fk is a conserved quantity. The number of matrices that can be diagonal - 
ized simultaneously is called the rank of the group. The rank of S U(N) is N — 1. 
Matrices that are made of Fj. and commute with all F&'s are called Casimir opera- 
tors. Particles which can be identified as the representation vector of a group, can 
be specified and classified by the eigenvalues of the Casimir operators and F;'s that 
can be diagonalized simultaneously. 

Representation matrix of the complex conjugate vector ip* is given by U*. Since 
its generator — F£ satisfies the same commutation relation Eq. (G.3) also, ip* is al- 
so a representation vector and is called a conjugate representation. If ip represents 
a particle, then ip* represents its antiparticle. Therefore, the quantum numbers 
of antiparticles are minus those of particles. If we define a covariant vector (with 
lower indices) by 

r = <v*) T = (i 1 *,! 2 *,--- ,i N *) = (&,&,-■ ,In) (g.io) 

It transforms as 



or ?//' ->• V 1 ' = ipi U< 
If a representation matrix U(A) of any element A can be expressed a 



(G.llb) 



Ui(A) 











U 2 (A) 














It is said reducible and written a: 



C.1.2 

Direct Product 



When there are two representations U(A), U(B) of dimensionality n A , n B which 
have elements A = {Aj}, B = {Bit}. Product of basic elements £,-, ?/t of each 
representation |jJ7j; spans (n A x n B )-dimensional space. The representation in 
this space U(AB) is expressed as 



\J{AB)(^ Vk ) = £j V = (U(i4)|j)(l/(B)i/ t ) 
and is called a direct product representation. It is written a: 
U(AB) = U(A) ® U(B) 



(G.14) 
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In general the direct product is reducible. 
Example C.l 



Each representation of the angular momentum J = J a and J = Jb has 2J a + 1 
and 2Jb + 1 elements. The representation of the direct product J a (g> Jj, has (2 J a + 
1) (2 Jj, + 1) elements. It is reducible and decomposed to J = ]\ © J 2 © ■ ■ • © J„ 
where J, = /„ + ;», ; 2 = J„ + / b - !,■•• , J„ = |J„ - _/ fc |. 



C.2.1 

Structure Constants 



It is customary to adopt Pauli matrices for SU(2) (F; = cT;/2) and Gell-Mann's 1 
matrices for SU(3) (F ; = A;/2). Although S U(3) itself is a general mathematical 
framework, it will be convenient for us to identify the three light quarks q l = 
(u, d, s) as the S U(3) fundamental representation state vector and express them as 

'"'-$■ d - i2 -[i- '- e, -0 

Since u <-> d [= S U(2) transformation] is a part of S 1/(3) transformation, we de- 
fine 1 ; which operates only on u and d. 



Then A*'s are isospin operators. In a similar manner, we define S 1/(2) operators 
"V spin" that act only on u ** s, and "U spin" that act only on d o s. 



ro o ii 

, A s 

Li o oj 


■E 


-n 



oj 


ri/2 o o -I 


L -1/2 J 






ro o oi 

1 , A 7 

Lo i oj 


■K 


01 
-i\ 

i oj 


ro o oi 

1/2 

Lo o -1/2J 







We have defined nine matrices, but 

I 3 - V 3 + Ui = (G.17c) 

and are not independent. We define 



h = 



^-> + *>-^[; ;i] 



The eigenvalue of the A 8 is called hypercharge and is denoted as "Y". The number 
of diagonal matrices is two, which is the rank of S 1/(3). The eight 2; satisfy the 
following commutation relations 



[22 

(T'T 



j + idiji-^- (G.18b) 



/ijt. ^ij k are totally antisymmetric or symmetric with respect to i j fe permutations 
and have values listed in Table G.l 

Tabled Structure constants of SU(3). 



Jl23 = 1 

/H7 = iz46 = flS7 = fuS = f 5 16 = / S 37 = | 

/458=/ 6 78=f 

dl46 = ^157 = ^256 = d 344 = d 355 = \ 

d 4 48 = ^558 = 468 = ^778 = ~ T77 



Some Definitions of Terminology Let F a (n) be an n-dimensional representation 
matrix, 

7r[F a (n)F b (n)] = T R (n)d fli , (G.19a) 

J2 F4nf = C 2 (n)l (G.19b) 
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By convention, the normalization of S U(N) matrix is chosen to be Tp = T R (n = 
N) = 1/2. With this choice S U(N) matrices satisfy relations 



where suffix F denotes fundamental (n = N F = N) and A denotes adjoint (n = 
N A = N 2 — 1). A general method to obtain C 2 (n) is given in Sect. G.2.3. 

V, U Spin Defining raising or lowering operators of I, V, U spins by 

;± = ^^' V± = ^^' U± = ^r~ (G.2i) 

2I 3 = [I + ,I-], 2V 3 = [V+, V-], 2U 3 = [U+, U-] 
In the quark model, q = (u, d, s) has quantum numbers 

h(q) = (1/2, -1/2, 0) 

VM = d/2, 0,-1/2) 

l/ 3 (g) = (0, 1/2, -1/2) 

Y(q) = (1/3, 1/3, -2/3) 
If we place the three quarks in a I3-Y plane and represent each q as a vector from 
the origin (referred to as a weight diagram), when we make representations of 
direct products of the state vectors, multiplication of u means an increase of I3, Y 
by 1/2, 1/3, respectively, which translates to an addition of the u-vector in the I3- 
Y plane. Similarly, the multiplication of d, s means the addition of d, s vectors. 
Actions of I-I-, V-j-, U± are depicted in Figure G.2. For instance the action of 1+ 
changes d to u, which means moving a point in the I 3 -Y plane by 1 unit to the right. 
Conversely, I- changes u to d and moves a point by 1 unit to the left. Similarly, the 
action of V± is to change s(u) to u(s) or to move a point diagonally, as is depicted 
in Fig. G.2a. 

G.2.2 

Irreducible Representation of a Direct Product 

a. Meson Octet Let us consider a composite system, spin mesons, made of quarks 
and antiquarks. In the terminology of S U(3) group representation, we are making 
direct products of q l and qj , or &■ = q l q- . By a unitary transformation 

q^q>=Vq, q ^ q> = qW 
or [0] = [&Yj -► 0' = U&W 

which means a product Tr(0) = qq = uu + dd + Is is invariant under S U(3) 
transformation. 



°v^ 



Of ~.^*6 



Figure C.I (a) (w, d, s) make a triplet 3 and have (/ 3 ,_V) = (1/2, 1/3), (-1/2, 1/3), (0, -2/3). 
They are expressed as vectors from the origin, (b) (T7, d,s) make 3*. Vectors (T7, d,s) have oppo- 
site quantum numbers to those of (u, d, s). 




m 



(a) (b) 

Figure C.2 / ± , V±, U± move the state in the di 



is a vector that transforms to itself and hence makes singlet. The remaining eight 
vectors mix together by the transformation and cannot be separated further and 
hence makes an multiplet of 8 members called octet. They are expressed as 



P j = 1% ~ 3 H «*** = ) ~ 3 Tr t^j] 



The conclusion of the above discussion is that the direct representation made of 
three q's and three q's can be decomposed into groups of one and eight members. 
We express it as 



3® 3* =13 



(G.26) 



Let us derive the same equation from a slightly different point of view. Base vectors 
of the direct product 3 (g) 3* are nine vectors, which are obtained by multiplying 
u, d, s in 3 and u, d,s in 3*. Take the example of multiplying u by d. As u has 
(I 3 , Y) = (1/2,1/3) and d, (1/2,-1/3), the product ud has (I 3 , Y) = (1,0). In 
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the 7 3 -Y plane, it is obtained by summing the two vectors u and d, as shown in 
Fig. G.3a. Other operations are similar. To repeat, first start from the origin. We 
place three vectors of (u, d, s) as in Fig. G.3b. Then add another three vectors of 
m, d, s from each of the three apexes, and we obtain a total of nine states. Three 
of them, uu, dd, s~s, come back to where they started, i.e. there is a triple point 
at I 3 = Y = 0. One combination of them rj' = Tr[0]/V3 forms a singlet, as 
we already explained (or according to a general rule Sect. G.2.2.C) and should be 
separated. The result is 3 <g> 3* = 1 © 8 as depicted in Fig. G.3c. 
b. Baryon Octet and Decuplet Baryons are made of three quarks. We first make two- 
quark composites made of nine product vectors T'-* = §'£ J , and see their trans- 
formation properties. They are, in general, reducible, but we can separate them 
into symmetric and antisymmetric parts: 



^■ L 



-i ] i { ) + hi' 



;') = 



J + S'J 



(G.27) 



As the permutation symmetry does not change by the transformation, A' J and S ,J 
transform to themselves. The nine product vectors have been decomposed into a 
three-dimensional antisymmetric group and a six-dimensional symmetric group. 
Namely 



3®3 = 3* 



(G.28) 



In the vectorial representation, they are expressed as Figure G.4a. That the three 
A' J belongs to 3* and not to 3 can be easily seen by looking at their quantum num- 



\/v w 




Figure G.3 (a) As u, d has (/ 3 , Y) = (1/2,1/3), (1/2, -1/3) ud can be obtained as the vectc 
sum of the two states and has (1,0) in the I3-Y plane, (b) Applying the same operation to all 
combinations of (u,d,s) x (u,d,s), a total of nine states are obtained, (c) 3 ® 3* = 8® 1. 




bers which a 
Define 



lultiplets. (a) 3 ® 3 = 6 e 3*, (b) 6 ® 3 = 10 e 
ne as (u, d, s). Mathematically, it cai 



be shown as follo\ 



is invariant under S 1/(3) transformation, because 



1 -+ B' = !,'£'* : 



s^r^ 3 r 



= £ im „(det U)|'| m r = £, m „|'| m r = B 

From Eq. (G.31), the vector tji's transformation matrix is shown to be ( U _1 ) T = 
U* , which means r]i belongs to 3*. Multiplying T ,J = 3* © 6 by £ fe in 3 further, 
we get from the first term of Eq. (G.28) (A l J | k ) 

3* (g) 3 = 1 © 8 (G.32) 

just like we obtained meson singlet and octet. But unlike the meson case, the sin- 
glet term in Eq. (G.32) is a totally antisymmetric combination of §"s made from 
A lJ | * and the remaining 8 terms constitute an octet. Product vectors of the second 
term of S 1J and £ k can be separated to totally symmetric part of which there are 
10 members and the remaining 8 members which form another octet. Namely 

6® 3 = 10 ©8 (G.33) 

To summarize, we have derived 

3®3®3 = 1©8©8©10 

(G.34) 
symmetry property : A MA MS S 
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Here, the singlet 1 consists of a totally antisymmetric and the decuplet 10 of totally 
symmetric combinations of £'s. The two octets 8 occupy the same position in 
h — Y plane, but they have different permutation symmetry. As we have seen, 
one is symmetric (MS = mixed symmetry) and the other is antisymmetric (MA = 
mixed antisymmetric) with respect to the first two indices. Weight diagrams of 
these multiplets are depicted in Figure G.4. 

c. General Rules of the Irreducible Weight Diagrams A way to decompose a prod- 
uct representation into a sum of irreducible representations is given by using the 
Young diagram (Sect. G.2.4). There is a theorem concerning the structure of irre- 
ducible weight diagrams. 



Any weight diagram of irreducible representations must obey the following rules 
(proof omitted). 

1. A weight diagram makes layers of convex polygons, generally irregular 
hexagons, with triangles in a special case (Figure G.4 and Figure G.5). 

2. Multiplicity of points on the sides and apexes of the outermost layer is one. 

3. Multiplicity of those points on the second layer is two and increases by one as 
we go into inner layers. 

4. When the polygon of the inner layers becomes a triangle, the multiplicity stops 
to increase and stay at the same value no matter how deep we go into inner 
layers. As a special case, the multiplicity of all the points on triangle weight 
diagram is one. 

d. Baryon Wave Functions To construct wave functions of 1, 8ma, 8 M s, 10s. let us 
first consider those without the s-quark. Since, u, d have isospin 1/2, the wave 
functions can be constructed using the Clebsch-Gordan coefficients of S U(2). As 
1/2 (8) 1/2 = © 1 with wave functions 

'|1,+1) =uu 
1,0) = -^(ud + du) 
1 1,-2) = dd 

(G.35) 

They have the desired property being antisymmetric and symmetric with exchange 
of the first pair of quarks. Adding the third quark to the wave functions, we can 
make I = 1/2 from the first 1 = and I = 1/2 and 3/2 from the second 1 = 1. 
Using Clebsch-Gordan coefficients we obtain 

(|l/2,+l/2) =^=(ud-du)u 
I = 1/2 MA : I ^ 2 (G.36a) 

(|l/2,-l/2> = -fe(ud-du)d 

1/2,4,1/2) = ^uud-iud+du^ 
1/2,-1/2) =--^{2ddu-(ud + du)d} 



I|3/2,+3/2> = uuu 
|3/2,+l/2> = -^(uud+udu+duu) 
^ (G.36c) 

|3/2,-l/2> = -±(udd + dud + ddu) 
3/2,-3/2) = ddd 

There is no totally antisymmetric combination. It can only be made using all three 
u, d, s. Obviously, I = 1/2 MA , J = 1/2 MS , J = 3/2 wave functions are members 
of 8ma, 8 M s, 10s- Since totally symmetric wave functions of 10 can be made easily, 
we list other wave functions in Table G.2. 

There are some ambiguity in constructing the neutral ( I 3 = Y = 0) wave func- 
tions. As the isospin is a good quantum number, we make E° from S + by applying 
I— and A° by making orthogonal to 2°. 

C.2.3 

Tensor Analysis 

a. Dimension of an Irreducible Representation A general product vector or a tensor 
which is a product of contra-variant vectors | ' and q covariant vectors |j is written 

as T^' = T, 1 ^'.'.'. and transforms like 

( r/J '//.)' = u' a ul ■ ■ ■ uj c ul d --- t?1;: (G.37) 

It is called a contra-variant tensor of rank pit q = 0, and a covariant tensor of rank 
q if p =0 and a mixed tensor if p, q / 0. We write the whole collection of the 
tensors as (p, q). The weight diagram of (p, q) is a irregular hexagon which has 

Table G.2 Baryon wave functions with mixed symmetry 4>ma, 4>ms and total antisymmetry <j>^. 



-^(ud-du)u -^{2uud - (ud + du)u} 

-^(ud-du)d -^{2ddu-(ud + du)d} 

-^(us-su)u -^{2uus - (us + su)u} 

\{(usd + dsu) - s(ud + du)} 7J;{2("d + du)s - (usd + dsu) 

-s(ud + du)} 

A=(ds-sd)d -±={2dds - (ds + sd)d} 

-^{2(ud - du)s + (usd - dsu) \{(usd - dsu) + s(ud - du)} 
-s(ud-du)} 

^(su-us)s ^{2ssu-(us + su)s} 

-L=(sd-ds)s -^{2ssd - (ds + sd)s} 



~^{s(ud - du) + (dsu - usd) + (ud - du)s} 
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upper and lower sides of length p and q (see Figure G.5). If p or q is 0, then the 
hexagon becomes a triangle. 

We can prove that 6'- — <5;j is a mixed tensor of p = q = 1, and that e^t, £ ,J ' k 
are covariant and contra-variant tensors of rank 3. Therefore, the product of a tensor 
by them is also a tensor. In general, the tensor (p, q) in Eq. (G.37) is reducible. By 
making contractions with <5*-, £ijjt, e ljfe we can make tensors of lower rank. For 
instance, 

R ] b *" = 6fT l a i^ = tIH'." (G.38a) 

s abcd- = £ aijTl } ^;;; (G.38t>) 

are tensors of (p — 1, q — 1), (p — 2, q + 1). If the tensor before contraction is already 
traceless or symmetrized, the above operation gives zero result which means it is 
no longer reducible. Therefore, a tensor belonging to an irreducible representation 
is symmetrized with respect to upper or lower indices and traceless. Therefore, it 
satisfies a condition 

(G.39) 



.. = 



As the upper and lower indices are already symmetrized, all we need is to take 
a trace of the first index. Expressing the representation of this tensor space as 
D N (p,q), where N denotes the number of independent elements and is given by 

N= 1 -(p + l)(q + l)(p + q + 2) (G.40) 



Proof: We arrange upper indices like 




Figure C.5 The weight diagram of (p, q) is a 
irregular hexagon which has upper and lower 
sides of length p and q. Multiplicity of points 
at periphery is one, and increases by one as 
one goes one step inside until one reaches 
a triangle. From there it stays constant. For 



multiple points, the configuration of the \ 
function is not unique and depends on tl- 
path one chooses. For a single multiplied 
point, the configuration is unique. For in- 
stance, the same 2 can be obtained fror 
through path/4^ 6 or C. 



For a (0 < a < p), b can take ~ p — a positions which means (p — a + 1) ways 
to choose, giving N u ways to arrange upper indices. 

N„ = X>-a + l) = l(p + l)(p+2) ( G - 42 ) 

Likewise there are Nj = (q + l)(q + 2)/2 ways to arrange lower indices, giving a 
total of 

N„N d = ±(p + l)(p + 2)(q + l)(q + 2) (G.43) 

There are as many traceless conditions Eq. (G.39) as N u Nd with p, q replaced 
with p — 1, q — 1, so the number of independent combinations is 

N = \(p + l)(p + 2)(q + l)(q + 2) - \p(p + \)q(q + 1) 

[ ( G - 44 ) 

2 n 

A few examples: 

D 1 (0,0) = 1, D 3 (l,0) = 3, D 3 (0,l) = 3*, 

D 8 (l, 1) = 8 , D s (2, 0) = 6 , D 6 (0, 2) = 6* (G.45) 

D 10 (3,0) = 10, D 10 (0,3) = 10*,--- 
Next, we will show that the multiplicity of the outermost layer is one. As 
h = \[n(u) + n(d)-n(u)-n(d)] 

Y = l -[n(u) + n(d) - 2n(s) - n(u) - n(d) + 2n(s)] (G M) 

p = n(u) + n(d) + n(s) 
q = n(u) + n(d) + n(!) 

the value of n ( q ) is uniquely defined when one specifies p (or q) , 7 3 and Y if q (or p) 
= 0. 

b. Value of the Casimir Operators How to derive C2 = C2(n) = F 2 = X!fc(^fc) 2 - 
First we rewrite the Casimir operator in terms of I, V, U spin operators. 

F 2 = ^{I+, I-}+Ij+Uv+, V-}+Uu+, U-} + \y 2 

2 2 2^4 (G47) 

= (/_/+ + V-V++ U+ U-) + l\ + 2/3 + -Y 2 

As the polygon is always convex in the h — Y plane, for the element (p max with the 
highest value of I 3 

J + max = V + max = U-(p max = (G.48) 
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Therefore, if we know the values of I3 and Y of the max . 

(F 2 > = (/ 3 > 2 + 2(I 3 > + ^ 2 (G.49) 

We can obtain the maximum value of J 3 by assuming all the p quarks are u and all 
the q antiquarks are d, which gives h = (p + q)/2. Then Y = (p — q)/3, giving 

{F 2 ) = ^(p 2 + pq+q 2 ) + (p + q) (G.50) 

Some examples are: 

C 2 (l) = (F 2 >x = (p = q = 0) =0 

C 2 (3) = (F 2 )3 = (p = l, ? = 0) = <F 2 ) 3 *=f 

C 2 (6) = (F 2 > 6 = (p = 2, ? = 0) =f (G.51) 

C 2 (8) = (F 2 > 8 = {p = q = l) =3 

C 2 (10) = (F 2 ) 10 = (p=3,q = 0) =6 

C.2.4 

Young Diagram 

We describe here, a convenient method to decompose a SU(N) product represen- 
tation to a sum of irreducible representations without proof. 

G. 2.4.1 Rules for Making Young Diagrams 

1. Young diagram makes a correspondence between an N-dimensional funda- 
mental representation with a box, and its complex representation N* with a 
column of N — 1 boxes. 



> (N - 1 boxes) 



□ 



For example 




SU{2): 


D=2or2< 


SU(3): 


D-3, 1 



(G.53) 

2. Any irreducible representation is expressed by columns and rows of n boxes. 
The arrangement of the n boxes has to be convex toward the upper left. It means 
if the number of boxes in the first row is k\, that of the second row A 2 , ■ ■ ■ and 
that of the last row I m , then 

X x >X 2 >--->l m , X x +l 2 + --- + X m = n (G.54) 



C.2 SU(3) 



Allowed arrangements Forbidden arrangements 



UJD 



s ^ 



3. Any permutation of boxes in a row is meant to be symmetric and that in a 
column to be antisymmetric. Because of this property, the maximum size of 
the column is limited to N — 1, because in S U(N) the maximum number of 
elements that are antisymmetric is N and there is only one combination to 
make N elements antisymmetric. If a column of m (> N) boxes does not exist, 
the dimension is the same, so we may omit it. Pictorial examples are, for N = 4 



One-dimensional 
representation 



4. For any representation, there is a conjugate representation. Combined together 
they make a N x N representation. For instance, for S 1/(3) 

(a) EP (b) 5@ (c) SEP =(d) EP 



= (a) 

(G.56) 

As we can see from (b), (c) is conjugate to (a), but applying the condition 
Eq. (G.55), (c)=(d), which coincides with (a), in other words, 8 = 8* in S 1/(3). 

G.2.4.2 Decomposition of Product Representations of A and B 

1. Insert 1 in all the boxes of the first row of the representation B, 2 in all the boxes 
of the second row, and continue to m in the m-th (last) row. 

2. Starting from the first row of B, pick any number of boxes and attach them to 
A in all possible ways subject to conditions 

Keep the condition "convex toward the upper left". 
In any row, the number in any box must not decrease from left to right. 
.. In any column, all the boxes must have a different number and it must 
increase downwards. 

Draw a line starting from the box at the extreme right to the first box on 
the first row. Move to the second row and enter the box again at the extreme 
right and continue, until the line passes all the boxes. At any box on the line, 
count the number (n ; ) of boxes on the path which contains the number i. 
The rule to obey is n; > Wj+i. 
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Example C.2 
Correct prescriptio 



Incorrect prescription 



m^EP©^ §=> ^ °™ i=™a 

(a) (b) (c) (d) 

Figure C.6 Diagrams (a) and (b) do not satisfy condition (i). Diagram (c) does not satisfy con- 
dition (ii) and (d) does not satisfy (iv), because on a line entering the first row from right and 
stopping at the first box, ni(= 0) < nj(= 1). 



I I l llll 

= tEr© 



^e™eH©F©F© 



= 27 010 010* 



There are two Young diagrams corresponding to the same 8, but their permutation 
properties are different, hence they correspond to different representations. 



C.2. 4. 3 Dimension of the Irreducible Representatior 

The dimension of a representation is given by 



D = 



(G.58) 



Calculation of F: Insert N in the box at the upper left corner. Increase the number 
by one as you move to the right (N + 1, N + 2, . . .). Decrease it by one as you 
go downward, for instance, insert N — 1 in the first box in the second row. See 
Fig. G.7a. D is given as product of all the numbers in the boxes. 



N 


N+l 


N+2 


N-l 


N 


N+l 


N-2 


N-l 


N 


N-3 







F=N(N+l)(N+2)(N+3) 
x(N-l)N(N+l) 
x(N-2)(N-l)N 
x(N-3) 

D=F/H 



*-- 


j± 


^ 


i?=! 


r 


f- 




\ a - 1 


i 


i 





H=h\ x hj x hi x hi, x 



Figure C. 7 The dimension of an irreducible representatio 
F/H. 



Calculation of H: Draw a line in any row starting from the extreme right, bend it 
at any box with a right angle downward and exit from below. This is called a hook. 
Let h be the number of boxes in the path of the hook; H is given as the product of 
all possible h's (Fig. G.7b). 



Example C.4 
Dimens: 



of LT in S U(3) is given in Figure G.8. 
I 3 I 4 I F=3x4x2 H=3 x 1 x 1 I T1 I h 



| 2 | 



D=F/H=8 

mpleofSU(3) representatio 



Example C.5 
SU(3): 

3*<8>3= 



6(8)3=10 8 



SU(4): Hadron multiplets that are composed of four quarks are easily constructed 
using the Young diagram. 

Mesons : 4 ® 4* = 1 © 15 

Baryons : 4®4®4 =4© 20 ©20 ©20 (G.59) 

Symmetry A MS MA S 

The symmetry property is the same as that of S 1/(3). The reason what was a singlet 
in S U(3) now is a quartet, is there are four ways to make a totally antisymmetric 
three-quark wave function out of four quarks. An octet and decuplet in S 1/(3) have 
become 20 adding baryons containing charm quarks. 
SU(5) 



5® 5* 



; 



5 (8)10 


= 40 © 40 © 10* 




10 (8)10 


= 5* © 45* © 50 




10* ® 10 


= 1 © 24 © 75 




Mesons : 


6®6* = 1©35 




Baryons : 


6®6®6 =20 ©70 ©70 6 


56 


Symmetry 


A MS MA 


S 



Appendix H 

Mass Matrix and Decaying States 



The Decay Formalism 

We describe here the time evolution of unstable particles in the Weisskopf-Wigner 
approximation [385, 386]. The arguments here closely follow those of [220, 283]. 
We consider a system that is described by a Hamiltonian 

H = H + H w (H.l) 

where H is the Hamiltonian of the strong interaction and free part of the weak 
interaction while H w is that of the weak interaction that induces transition and 
is assumed to be a small perturbation to H . The eigenstate of H consists of i 
degenerate discrete states \Aj) and a series of continuum states \B n ) to which A 
can decay weakly. Since we want to apply the formalism to the neutral K meson 
system, we set i = 2, |A 1(2) > = \K°(K )) and|B„> = 2ji, 3jt, n ± l^v, ■■ ■ . 

H \Aj) = Eo\Aj) 

H \B n ) = E n \B n ) 
As an initial state at t = we consider a superposition of the discrete states | A). 

|t = 0> = a 1 (0)|A 1 > + a 2 (0)|A 2 > (H.3) 

The time evolution of the state vector \t) obeys the Schrodinger equation. 

^|t)s = (Ho+H w )|t)s (H.4) 

where the subscript S denotes that the states are those in the Schrodinger picture. 
They have time dependence |t) s = e~' Et \t = 0) s - In the following discussions, it 
is more convenient to work in the interaction picture: 

| t) _ , t)l = e i«bt, t)s ( H.5) 

For simplicity, we omit the subscript I for the interaction picture. The state 1 1) obeys 
the equation 

ij- t \t) = H w (t)|t) , H w (t) = e' H » t H we -' H » t (H.6) 
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We write 

|t = 0) = |t = 0) s 

|t> = a 1 (t)|A 1 > + a 2 (t)|A 2 > + ^b„(t)|B„> 

Inserting Eq. (H.7) in Eq. (H.6) we obtain 

dt \ j=l2 ' 

+ J2e^°- E ^(A J \H v/ \B n )b n (t) 

i^= £ ^-*)*(Bm|Hw|A t >» k (t) 

Equations (H.8) are exact. We impose initial conditions such that 

ai(0) = a 1; a 2 (0) = a 2 
b m (0) = 

Weisskopf-Wigner's first approximation (1) is that with the initial conditions we 
imposed the weak interactions in the final states are negligible, which is equiva- 
lent to discarding the second term in Eq. (H.8b). This means we regard pions and 
muons, which are the decay products of the K mesons, as stable. With this approx- 
imation, Eqs. (H.8) can be solved. We obtain 

M*) = -' J2 f ^'e i{Em - Ea)t '(m\H w \k)a h (t') (H.lOa) 

aj (t) = aj-i J2 [ dt'(j\Hy J \k)a k (t') 

_ Ej[ dt ' j dt"e i ^- E ^ t '- t "Hj\H v/ \n)(n\H v/ \k)a k (t") 

(H.lOb) 

where we abbreviated \A k ),\B n ) simply as \k),\n). Unlike Eqs. (H.8), Eqs. (H.10) 
are no longer mutually coupled. They can be solved by means of a Laplace trans- 
form. Define a new variable 



(H.9) 
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insert it in Eq. (H.lOb), and we obtain 

a J (s) = ^-- l -J2w jk (s)~a k (H.12a) 

WjM = U\W) + p Jl %™ n l™ (H.12b) 



« ( o)=r fli i . «w=h (t) L -a{s)=\* i{s) ] 

[ ' [a 2 \ [ > [a 2 (t)\ [ ' [a 2 (s)\ (H .13) 

W(s) = [W jk (s)] 

Equations (H.12) can be cast in a form 

[s + iW(s)]a(s) = a(0) (H.14a) 

.-. ~a(s) = [s+iW(s)]- 1 a(0) (H.14b) 

Then the inverse Laplace transform may be applied to yield 



2- Loo dSeO(S) = ^/ _ loo 
1 r+oo e (iy + so)t 

''T^iJ.^ Y [ Y -is +W(iy + s l 



« ( t) = -/ ^« (S) = — / ^ RT7W ^«(0) 

L l ' J (H.15) 



The value of s has to be chosen in such a way that the integration path in the com- 
plex s-plane lies to the right of all singularities of [s + i W(s)]^ 1 . From Eq. (H.12b) 
we see that W is analytic except at those points where 

E - E n + is = (H.16) 

which are on the imaginary axis. Therefore we can set s > and infmitesimally 
small and rewrite it as e. Then the question is from where does the main contribu- 
tion to the integral come from? 

For H w = we have W(s) = and the singularity of the integrand is a pole at 
s = 0. In other words, the main contribution comes from the vicinity of s = 0. In 
this case we obtain 

a(t) = a for t > (H.17) 

as is necessary, since the states | At) are stable when H w = 0. When it is switched 
on, this pole moves but for small perturbation, it will stay close to the imaginary 
axis. Moreover, W(s) is expected to vary appreciably only when s changes by an 
amount of ~ E — E n , i.e. over a range of energy determined by the strong inter- 
action H . The second approximation (2) of the Weisskopf-Wigner approach is to 
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replace W(s) in the vicinity of s = 0, Res > 0, by a constant 
W(s) -> W = lim W(e) 

This is an approximation valid for time t very long compared with strong int( 
tion scale. Then using the residue theorem to evaluate Eq. (H.15), we obtain 



Returning to the Schrodinger picture and denoting = e lHot a, it satisfies the 
Schrodinger equation with the Hamiltonian replaced with the mass matrix 

i — &(t) = A&(t) (H.20) 

where the mass matrix A is defined by 

A = H +W = M--r (H.21a) 

M=-(A+A t ), r = i(A-A') (H.21b) 

Both M and r are hermitian but A is not. The matrix elements are expressed as 

Mjk = Eodjk + (j | Hw |k) + p^ O , ! H wk)(H|H w lfc) ^^ 

r Ji = 2jiY i d(E -E n )(j\H w \n)(n\H w \k) (H.22b) 

where P means principal value, 7") u are determined solely by matrix elements of 
H w for energy conserving process, i.e. they are on-shell amplitudes. 

Comparing Eqs. (H.22) with Eq. (6.38), we see that the mass matrix is essen- 
tially an unperturbed strong Hamiltonian plus the weak transition operator i.e. 
W = T w . To the first order in weak interaction, it reduces to H + H w . If H w is 
a AS = 1 operator, it does not induce mixing between K° and K°. It is a second 
order effect as expressed in Eqs. (H.22). The difference between T w here and the 
general transition operator T in Eq. (6.38) is that the former is valid only for time 
long compared to strong interaction scale and that only continuum states to which 
the discrete states can decay appear in the intermediate states. 

Equation (H.20) looks like a Schrodinger equation but A is not hermitian. With- 
in a sub-space consisting of K° and K° which are degenerate eigenstates of H , the 
transition operator T w = W gives modifications due to the weak interaction pro- 
cess. The antihermitian part of T w describes the exponential decay of the states. If 
all the fjk's were zero, the system would evolve without decay. But the off-diagonal 
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elements of T w still cause mixing of iC° and K° and the eigenstates of the mass 
matrix are mixture of both. Denoting the eigenstates and eigenvalues as | a) and 
A CT , the time evolution of the states is given by 

\o(t)} = |a(0))c- u »' , ACT = m„-i^ (H.23) 

and represents exponentially decaying states with mean life X = l/r a . 

Note that the exponential decay law is only an approximation. If W in Eq. (H.15) 
is expanded in Taylor series, inclusion of higher powers of y results in corrections 
to the exponential decay law. It can be shown that the approximation fails both for 
very short and for very long times [146, 228, 229]. 



Appendix I 

Answers to the Problems 

Chapter 2: Particles and Fields 

Problem 2.2: For a static solution 9 = 0. Applying — V 2 + fi 2 to (p(r) = f -^ 
e ip ' r $(p) and using <5 3 (r) = / d 3 pe l P- r /(2jt)\ we have <p(p) = g/(p 2 + ft 2 ). Then 

m r = g / — £-— = / p 2 dp / dz— - 

^' S i (271)1 p 2 + [i 2 (2jt) 2 J v F y_i p 2 + /* 2 

g 1 Z" 00 ,„[" 1 1 1 _ e"" r 

4jrr 2ttj J—oo Lp — 'A P + if* J 4jrr 

In deriving the last equation, we have closed the contour through the upper half- 
circle of the complex p plane. 

Problem 2.3: r w =hc/m w ~ 200 MeV • lO -13 cm/81 GeV ~ 2.5 x 10- ls cm. 
Problem 2.4: Assuming the size of the Crab Nebula is I = 400 ly = 400 x 0.946 x 
10 16 m, m y < fic/lOOly ~ 2 x 10~ 7 eVm/4 x 10 18 m ~ 0.5 x 10~ 25 eV. Assuming 
an intergalactic distance of 1 Mpc ~ 3.26 x 10 6 ly, m y < 0.4 x 10" 29 eV. 
Problem 2.8: I = $£ = 4jt£ (j^f = 2.45^^ (m) 

Chapter 3: Lorentz Invariance 

Problem 3.1: Applying Lorentz boost to the longitudinal component 
X°' = yx + (3y(p.x), *||' = {3y x °+y ( p. x ) 
.-. X ' = X > { + x± = [HPy x ° + y 0. x )] + [x- p { p . x)] 

= x + pyx° + (y-l)<fi-x)P 

Problem 3.2: (3.28e): A relativistic wave equation for a scalar field is given by the 
Klein-Gordon equation. It has a plane wave solution ~ e' k >' x '' , which should be 
a Lorentz scalar. Then k^ X /l is also a scalar, which in turn means fe /( is a Lorentz 
vector. Another way of saying that k ft X f ' is a Lorentz scalar is it is a phase, i.e. a 
pure number which does not depend on the coordinate frame. 
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(3.28g): Since (pc; j) is a vector and d^d 11 is a Lorentz scalar, the Maxwell equa- 
tion 9„ F" r = j 1 ' tells that (Af = <p/c; A) must also be a vector. 

(3 .28h): a; = ^ = |^ (3 = b) m;9 f 

which shows that 9,, is a covariant vector. 
Problem 3.6: 



tj" = - In - 



i + (P + P)i(i+m 

"2 *"!-(/? + /?')/(! +/?/?') 



- = -In 



"2 (l-/?)(l-/?') " 



- Spo^K + gpaKe" + gpa^d'l 



Chapter 4: Dirac Equation 

Problem 4.5: Ipyfip = ip^l, a x , a y , a z )ip. e «*W2 = cosh(?/ x /2) + a x sinh(?/ x /2). 
The transformation property of the 0-th component is 

Ijfy V = ^'[coshfWZ) + a*sinh(W2)][cosh(W2) + a x smh( Vx /2)]y 
= V t [{cosh 2 (? ?x /2) + sinh 2 (?; x /2)} + 2sinh(?/ x /2) cosh(rj x /2)a x ]ip 
= ^"[cosh^ + sinh^a*]^ = ylpy ip + /ty^VV 

\p y^xp' — ?//' [cosh()7 x /2) + a x sinh(?7 x /2)]a x [cosh()7 x /2) + a x sinh(); x /2)]^ 
= ^'[{cosh 2 ^^) + sinh 2 (?7 x /2)}a x + 2sinh()7 x /2) cosh()7 x /2)]^ 
= ^"[cosh^c^ + sinh?7 x ]^ = PyyJy°ip + y^y l ip 

ljj'y 2 ip' =ljjy 2 ip, Ip'ylip' = yjy i ip 

which reproduces Eq. (3.62). 
Problem 4.6: 



u r (p)u s (p) = (I/Vp 7 ^. ^Vp 7 ^) 



= f r Vp-ap.<7+ VP^P^)t s ( = 0) 2m|t& = 2m<5 rs 



= §t(p- CT + p-CT)£ s (4 = 
Others can be proved similarly. 



2Ed rs 
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£ u r{P )u r(P ) = n [^f|] n (§;y^ i;vf^) 



p o ijp -op ■ o\ 



= /+« 



Alternative proof: Using A+ = J2 r u r{p)u r (p) and Eqs. (4.64), one can directly 
prove Eqs. (4.67). 
Problem 4.10: 

(a) u(p')yu(p) : We show for the case of y 1 . In the nonrelativistic limit E = 
E',\p\ = \p'\ = p,p' = pn'.p = pn. 

=,</>/».<,, . Nft-v^.tfv^) [_"„ ;] [p|] 

= N 2 !,- 1 [p(ff-»'<7i + tr 1 er-n)]| s 
= §;'[(/ + P) + ^x(1' , -P)] 1 & 

where we have used Jp ■ a = *Jmexp[jt]a- n] = ,/ I±2* _ l^=jn a . n 

(b) u(p')u(p): We keep terms only to lowest order in the 3-momenta. Thus up 
to 0(p 2 ,p' 2 ) 

p = (m,p), p' = (m,p'), p-a=m-a-p , 

p ■ a = m + a- p (1.1) 



u(p')»(i>) = ^(y/p'-a, yjp' ■ a) 



«1 

= r f [ v / (]''-%-ff) + j(p'-o)(p-o)]t 

= ^[^(yn + a-p')(m-a-p)+ ^r^V- 



■{p-v') , 



1 + 



■ (g ~ P') 



+ 0( P 2 ) |~2m^| r 
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(c) ( p ' | y 5 1 p ) : First use the fact that 

u(p)= £ u(ph u(p')=u(p') 1 L 

■•■ W)y 5 *(p) = ^ Np') /V»(p) + «(i> V /»(*)] 

Rp-a-p'-a) 

[ -(p-a-p'.CT)J 

u(P , )y 5 (p'-/)u(p) = u(p')VyV-jO»(J>) 

= ^[y/p'-o{p-d-p'-d)Jp -a- y/p'-a{p-o-p' -a)y/p-d]^ 
Using (p ■ a)(p ■ a) = (p ■ a)(p ■ a) — m 2 , we can organize the above equation to 



= 2m£" V(P' • °){P ■ o) - J(p' ■ o)(p ■ o)]£ 
up to here the equations are exact. Using the approximations Eq. (1.1) 
u(p')y 5 u(p) = l-u { p')yStf- f)u(p) 



LV -"■" / v im / j 

Problem 4.12: Using Eq. (4.120) 

ip L = (Ni + »N 2 ) L = f - c^*, Vr = (Ni + iN 2 ) R = i(j; - ff 2 £*) 
Then applying Eq. (4.122) and Eq. (4.123), we obtain 

■V)(a 2V *) 



i(9o-ff-V)VL = i(9o-ff-V)£-i(3 ( 
= mi(t]-o 2 £*) 






Similarly 

This is the Dirac equation Eq. (4.51) expressed in two-component spinors. 
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Chapter 5: Field Quantization 
Problem 5.1: 

1. Complex field: 



2. Electromagnetic field: 
d(d ll A v ) = 



= -mxjJ -> ^(iy^ + m) = 



Problem 5.3: 

-^- = (Jacobian of x" -> *") = det [j^A 

= det(d£ + d v dx>') = 1 + 9,,(r5x'') 

.-. f{d 4 x' - d 4 x)£ = f d 4 x£d u {dx fl ) = /" d 4 x [d u {£dx fl ) - d u £6x fl ] 

The first term vanishes because it is a surface integral. 
Problem 5.5: The second term of Eq. (5.47) can be written as 

d fl TI!dx" = d fl Tl'e v pX P = d^T>' v e vp x p = [3,, {T> lv x p ) - T pr ]e„ p 

= - [d u (T" v x") - T pv ] t pv ^ - [3, (T»»x v ) - T v "] t vp 

= ]- [9,, (T' lv x p - T^x") - T pv + T vp ] e vp = 
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Problem 5.6: Differentiating the Hamiltonian density 

9t 3t 2 9t 3t 3t 

Changing the first term by use of the Euler equation 



It is a surface integral and vanishes at infinity. It also proves problem (2). Next 
differentiating IP 

t aV 30t a^ v 3^30 

3t 2 at at at at 

= (V 2 - m 2 )0'V0 + V0'(V 2 - m 2 )0 + V f -g-^-J 
Partially integrating the V 2 part 

= [a i t V0 + {0 1 -h- 0}]^=~oo - V(V0' • V0 - m 2 0'» 



They are all surface integrals and vanish. 
Problem 5.9: If the operator e'"? satisfies 

0(q+a) = c^O(q)e-^ (1.2) 

then, for infinitesimal displacement e, 

° ( ~ q) + e d^° { ^ + '" = ° ( ^ + **&' ° m + '" 

By putting O = q, we obtain [q, p] = i. Namely, space translation operator is the 
momentum operator. The above equation means j* O(q) = i[p, O]. In the Lorentz 
invariant formalism, translation operator is defined by 



Namely the space part is defined with opposite sign compared to equation (1.2). 
Therefore to move x' — >■ x' + a' we have to use — P' = P;. Hence 

V;0„ = -i[P\ H ] = i[Pi, O h ] 
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Problem 5.12: Inserting 

J (2jr) 3 V2^ L k J 

»(y) = w t w = /i|vf[ c ''""" + '""-' , " + """""] 

[<P(x,t), *(*,*)] = jfy (2 ^)6 2i [ct,c ^ 

- [dj, ^] c -«*-»M+»(»*-« f )« _ [c ,, dj|]e i*^l,^<(«*-«. f ). 

+ [<,<it] 6 -<»«-<*rH<«»-«. f )«) 

The third and fourth term vanish and the first and second term give fe = p by 
integration 

= /" _if±_ w Hx-Y) + e -*(«-r) } = j 5 3 (x _ . 
7 2(2jt)^ ; * r ' 

Problem 5.14: Prove by induction. We ignore the normalization factor since that do 
not change before and after the calculations. Using 

we can prove for the one particle state 

H|p) = AT£ffl k aJa k aJ|0) 

= Nj2™kal(d pk + alaia k )\0) = (o p \p) 

Next operating the Hamiltonian on | n) = \j>i,pir") = a ^i I P2. • • • ) = a' pi \n — 1) 
and assume H|n-1> = (oj 2 H <w„)|n - 1), then 



H|n) = £ ©talatajjn - 1) = £ ffltaft^j + <«*)|n - 1) 
= <yi «t i |n-l) + «t i H|n-l) 

= o» 1 |n) + (© 2 + --)*J 1 l»-l) 
= (fl, 1 + fl , 2 + ...)|„) 

Chapter 6: Scattering Matrix 
Problem 6.1: Using 

(/|H/( tl )H[(t 2 )|i> = J2(f\ H 's\n) e - i{E "- E ^- eM 
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The expectation value of the second order S-matrix becomes 

(J|S|i> = (-i) 2 /_ oo dt 1 ^dt 2 (/|H[ s (t 1 )H[ s (t 2 )|i> 
= -^(f\H IS \n)(n\H IS \i) 

x r dh r dt 2 e-'^- E f^-^-'^- E ^-^ 

= -J^(f\H IS \n)(n\H IS \i) 

l di > e " JrEi),, " £|,ll i iT^*-^« 

W ,Z ^ Ei-E n + ie 

where we changed the integration variable t 2 ->• r = t x — t 2 in going from the 
second to the third line. 

Problem 6.2: From Eq. (6.51), dE^ = fiydcosO*, dQ* = d cos 6*d<p*. There- 
fore ^r OC J^JW OC j|^- 

Problem 6.3: 

s = (j>i + P2) 2 = P? + v\ + 2(Pi • Pi) 
t = (pi~ Pi) 2 = p\ + P\ ~ 2(Pi • Pi) 
u=(p 1 -p,) 2 = p 2 1 +p 2 4 -2(p 1 -p 4 ) 

Using p 2 = m 2 , p 2 — pi — p\ = —pi, we obtain s + t + u = ^ mj. 

Problem 6.4: Using the flux factor F = 4£ 1 £ 2 Vi 2 = 4EiM(pi/ Ei) = Ap\M and 

the phase space volume dLIPS, the cross section is given by 

da = \M fi \ 2 dLIPS LAB /F 

1 ' V a V Wl Fl F4) p (2 jr) 6 4E 3 E 4 

/ d 3 p 4 gives 1, at the same time changing the 
/ d 3 p ir we put the content of the d function as 



G = £3 + E 4 = Pi + J(p x -Pi) 2 +M 2 



= Pi+ y/pi + P\ ~ 2-PiPi cos 6 + M 2 

' Wi =1+ Ea~ 

= (Ei + M- p 1 cos6)/Ea = [M + pi(l- cos 6)]/E 4 
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Now we introduce q = pi — p 3 = p 4 — p 2 and use pj = pj = M 2 , p 2 = pf = 0. 

P4 = (?4 -P2 + Pi) 2 = {q + Pif = q 2 + 2{p 2 ■ q) + pj 
:. -q 2 = 2(p 2 ■ q) = 2M(E 1 - E } ) = 2M( Pl - p } ) 

= -(Pi-p 3 ) 2 = 2p 1 p i (l-cosd) 
:. p 1 (l-co S e) = M(p 1 -p i )/p } 

Inserting the above equalities, we obtain dG/dp^ = piM/piE^ and 

dLIPSyjj = 1 f^LY 1 ff dQ IAB = ffdQlAB 

F 4piM V9P3/ 16^ 2 E 3 E 4 64jr 2 (Mp!) 2 

. da_\ = (PA 2 \ M f>\ 2 

Problem 6.6: 

1. Proof of Eq. (6.142a). Using y s y 5 = 1, y 5 y>' = -y''y 5 
Trly'' 1 /'' 2 ---/'' 2 '^ 1 ] = Tr[y 5 y 5 y'' 1 y / ' 2 --- 1 / / ' 2 " +1 ] = Trly 5 /'' 1 /'' 2 • •• y'' 2 » +1 y 5 ] 

= -Tr[y 5 y / ' 1 y" 2 ---y 5 y / ' 2 " +1 ] = (-l) 2 " +1 Tr[y" 1 y'' 2 ---y'' 2 "+ 1 ] 
= -Tr[y f ' l y f ' 2 ---y f ' 2n+1 ], :. T^y''^ 2 --- y'' 2 " +1 ] = 

2. Proof ofEq. (6.142b) Since Tr[A] = Tr[A T ], Cy^C -1 = -y'\ we insert CC _1 
between all y's. 

Tr[yV 2 ---y"] = Tr[y" T ---y 2T y ir ] = Tr[Cy" r C- 1 ---Cy 1T C- 1 ] 
= (-l)"Tr[y"---y 2 y 1 ] 

3. Proof of Eq. (6.142c). Taking the trace of both sides of y"y" + y v y fl = 2g flv , 
anduseTr[y''y"] = Tr[y*'y''] 

4. Proof of Eq. (6. 142d) . The first step is 

Tr[y fl y v y p y ] = Tr[{2g' ,r - y v y ll }y p y a \ 
Using Eq. (6.142c) 

= 8g'"y ff -7r[y v y f 'y p y°] = 8g''"g p " - Tr[y " {2f - y p y f '}y°] 

= gfgPO _ SgfPgVO + Tr[y "yPyf'y") 

= 8 g""g^ - 8g" p g"° + Tr[y "y"{2g"" - yV<}] 
= 8g" F g p " - 8g"V + 8g""g" P - Tr[y"y "y p y °] 

In deriving the last equation, we used invariance of the trace under cyclic per- 
mutation. 

•'• Tr[y''y "y p y "] = 4g""g'> - 4g up g v " + 4g f,a g" p 
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Problem 6.7: Using Eq. (6.148) 

u s {pf)y°Ur(Pi) = u s (p f yu r { P i) 

where we have used p = pp, Pf = Pi = p, £/=£* = £. The first term is the 
spin non-flip and the second term is the spin flip term. Since (p c ■ p t ) = cos 8 and 
\p f x Pi \=smd, 



J2 (A+ B) = 2x(E+ m) 2 1 + 



(E + m) 2 



(E + m)2 



Factor 2 arises because each term is counted twice, for non-flip term r = s ■ 
and flip term r = 1, s = 2 and r = 2, s = 1. Using cos = 1 - 2 sin 2 (0/2) 



"[('♦<»£ 



Dividing by 2 for taking the average for the initials state and also by 4£ 2 for particle 

normalization recovers the Eq. (6.147). 

Problem 6.8: Considering only the vacuum in the intermediate stage 

(f\ie[^(x)d"<p(x)-d"^(x)<p(x)]\i) 

= ie{(f\(r(x)\0)(0\d u cf>(x)\i)- (f\d"^(x)\0)(0\<p(x))\i)} 
= e [pf e^Pi-Pf)— W-W + p * fe -Hn-Pf)-*+H*-*f)t} 

The first and the second term give the same contribution after integration over x. 
Comparing with Eq. (6.135), we see only u s y f ' u s is replaced with 2pf. Therefore, 
the only modification in the final cross section is to replace (1 — v 2 sin 2 |) with 1. 
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Chapter 7: QED 

Problem 7.1: Noting q = p 3 — p 1; q 2 = pj + pj — 2(p 1 • p 3 ) = 2{m 2 — (pi • p 3 )} 
-^I''" = (p 3M - p Ul ) Vp^pl + p"pl + yg'"" I 

= \{pl ~ (Pi ■ Pi)}pl + {(Pi ■ Pi) ~ PiiPl + \<A 

= yK-yK + y<7 l ' = 

Problem 7.2: The invariant matrix squared is given by 

\M\ 2 = -4-[(t - M 2 ) 2 + (u - M 2 ) 2 + 2sM 2 ] 

This can be derived from Eq. (7.17) and Eq. (7.16b) by crossing s ^- u, t ^- s, 
u —*■ u and neglecting the electron mass m 2 . In CM we have 

s = 4E 2 , u= M 2 -2E 2 (1 + /3cos6) , t = M 2 -2E 2 (1 - /3 cos 6) 



—^\M\ 2 = [-2E 2 (1 - /3 cos 6)} 2 + {-2E 2 (1 + ft cos d)] 2 + 8M 2 E 2 
= 8E 4 (1 + I3 2 cos 2 6) + 8M 2 £ 2 
= 8E 2 (E 2 + p 2 cos 2 6 + E 2 - P 2 ) 
= s 2 [l-( y S 2 /2)sin 2 0] 



Pf 

3-^ 



dQ ~ Pi Mjri~s lMl 



*■ cttot = 



^-M''<^' + '£&^*] 






Using p 2 = m 2 , ]t Jt = k 2 = 0, /u(j») = ««({>), u(p') f = mu(p'), p - V ■■ 
p'-k. 
Each term in [• ■ ■ ] gives 

„ t/+k+m „ 2(p-k)- jitZ+mk 



-2(p'-k) 
The first and the second terms compensate each other. 
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Problem 7.5: We calculate interference terms only. Putting M = Mi + M 2 , 

= ^[//(/+ Wy" /y*(/- /Oy„] 

Using y„ ^ ? £y" = -2 (Z £ y„ A" Jf y " = 4(A • B) 

l«Wi«W*| = -^-(p + k) ■ (p - k')Tr[/ /] 
noting (p • p') = -t/2, (fe • V) = -(t + Q 2 )/2 

\MiM*\ = — t{s +t + u + 2Q 2 } = — tQ 2 
Next we prove \M 2 M*\ = |^M^*|. Since 

\m 2 m*\ = ^-Tr[ y/i (j/- /Or* &"&+ #)y" /] 

We only need to prove 

T r[y 1 y 2 ---y"] = Tr [y"---y 2 y 1 ] ■ 
s 

Tr[A] = Tr[A T ], Cy" T C _1 = -y" , 

we insert C C _1 between all y 's. 

Tr[yV 2 ---y"] = Tr[y" T ---y 2T y lr ] = Tr[Cy " r C _1 • •• Cy ir C _1 ] 
= (-l)"Tr[y"... y y] 

Problem 7.6: Derivation of the Klein-Nishina formula. 
From the assumptions 

#/=-/#, r/ = -/r. it=-ti, it' = -t'i 

(/ + m) /u(p) = /(-/ + mjw^) = , (/+ m) /u(p) = 
As we do not take photon sum 



" 2 L(2k-P) 2 (2k-p)(2k'-p) (2k'-p)(2k-p) (2V-p)*\ 
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I V can be obtained from J by replacing with e ^ e' , k ^ — k' . 1 1 = III with the 
same reason as the Problem 7.4. We only need to calculate J and 1 1. As the rest is 
straightforward but lengthy calculations, we only show results of each term. 

I =8(fe -p)[(k' ■ p) + 2(k ■ £ ') 2 ] 
II +111 = 16[(fe • p)(k' ■ p){2(e • e') 2 - 1} - (fe ■ e') 2 (k' ■ p) 
+ (k'- £ ) 2 (k-p)] 
I V = I(e -> e', k -+ -k') = 8(fc' • p)[(k • p) - 2(fc' • £ f\ 

\M\ 2 = Tj^— 2 (k ■ P)[(k' ■ p) + 2(k ■ £ ') 2 ] 

+ -f— 2 (k'-p)[(k-p)-2(k'- E ) 2 } 

[(k-p)(k'-p){2(e- £ ') 2 -l}-(k- £ ') 2 (k'-p) 



(k-p)(k'-p) 
h(fc'-e) 2 (fc-p)] 



• i^l -( k '\ l^l 2 - " 2 KV 
■■ dQlua {kj 64^ m 2 Am 2 \w) 

Problem 7.7 (2): 

S/l - <5 /; = -ij d 4 x(y(fe)|A„|0>j"(x) = -i€ / ,(A)j(k)" 
dP,, = |S/i- 



^[f 



(e{X)-pj) (6(A)- Pi ) T 



Chapter 8: Renormalization 
Chapter 9: Symmetries 

Problem 9.2: To prove the Theorem 9.1, we put 
f(t) = e tA Be- tA , f(0) = B 
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differentiating by the parameter t 

^ = e tA [A, B]e~ tA = [A, f(t)] 

This is a differential equation of the first rank and uniquely defines a function if 
the initial condition at t = is given. By changing A to t A, the right hand side of 
Eq. (9.15) is shown to satisfy the same differential equation and the same initial 
value. So both sides of the Eq. (9.1) are the same functions. 

To prove the Theorem 9.2, change A^ tA, B ->■ tB and differentiate by t. The 
left hand side is 

g(t) = e tA+tB , g'(t) = ^ = (A+B)g(t), g(0) = l 
Differentiating the right hand side [= h(t)] after the above change 

h'(t) = e tA Ae tB e~i' 2lA - B] + e tA e tB (B - t[A, B])e~i' 2[A ' B] , h(Q) = 1 
Using Eq. (9.15) and the condition 

e tB A= (a+HB, A]+ 1 -[tB, [tB, A]] + ---\e tB 

= (A+[tB,A])e tB 
:. e tB [A, B]=([A, B] + [[tB, A], B\)e tB 
= [A, B]e tB 
Similarly 

e tA [A, B] = [A, B\e tA 

,-. h'(t) = [(A - t[A, B])e tA + e tA B] e tB e -i' 2 [ A ' B l 
From Eq. (9.16b), e tA B = (B + t[A, B])e tA 

.-. h'(t) = (A + B)e iA e tB e-2 ' 2 l A ' B l = (A + B)h(t) 

Since g(t) and h(t) satisfy the same differential equations and initial conditions, 
both are the same function. The second equation can be proved by noting 

Problem 9.3: 

Putting l/ = (A 3 A 4 |r^(£)|AiA2) and using the partial wave expansion 



aror = JdQj2\fhUMX 2 ( 



AsA 4 [ P J J' 

■ E< 2 -/ + 1 )i T /i 2 
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We have used f^dcos 8 d ] MM ,(8)d ] MM ,(6) = 26 } y/(2J + 1) to derive the last 
equality. Then 

= ^Epj + 1)1^-^ 

F }MM 

Ingoingto the third equality we have used ImTj = Im[(e 2 "^ — 1)/ i] = 1 — cos2<3j, 
!T/| 2 = 2(l-cos2d ; ). 

Problem 9.4: The Maxwell equation says d fl d /l A v = j 1 ', the properties of A" and 
j v are equal. Since j" = (p,pv), A'°(-t) = A°(t),A'(-t) = -A(t). For the trans- 
formation properties of the quantized fields, refer Eq. (9.133) and Eq. (9.134). The 
properties of £ and Bean be obtained by considering £ = — VA°— d t A, B = VxA. 
Problem 9.6: Use Table 4.2 in Chap. 4. Or use 

2 Y ° = ~4 €p " y ' 

\-€%V a = F f,v , F>'(£, B) = F f,v (-B,-E) 

-y 5 o f "'F flv = --a' lv F flv = -ia ■ B-a- E 

Problem 9.7* : Define operators 

5>J - «1)(*H - «t) = £(N* + T k ) = N + T , 

fc it 

N k = ajajfc + b\b k , T k = -(a[b k + b\a k ) 

Then [N t , T q ] = [N, T] = and 

[N k ,a q ] = -d qk a q , [N k ,a\]= 6 qk a\, [N k , b q ] = -d qk b q , 
[N k , b\] = d qk b\ 

[T k ,a q ] = d qk b q , [T k , a\\ = -d qk b\, [T k ,b q }= d qk a q , 
[T k , b<] = -d qk al 

Applying the above commutation relations to the formulas 

If [A, [A, B]] = [B, [B, A]] = ->■ e A+B = e~2 [A ' B1 e A e B = e A e B e _ 2[ A - B ] 
e A Be- A = B + [A,B]+2j[A,[A > B] + ... 



I Appendix I Answers to the Problems 
we obtain 

ia(N+T) _ iaT iaN _ 



--(t- ia a t , e ~ ia b q ), 

= (a q cosa + ib q sma, b q cos a + ia q sin a) 



*"">,. b q )i 
e iaN (bl at), 
e' aT (a q , b„)i 
e iaT K b\Y 
Choosing a = a/2, C = e ; f < N + T ) = e ; f N e'f r = e'f Vi N 



Ca,C =exp^-rJexp^-Njo,exp^-i-Njexp^-i-Tj 

= exp[i|r](-ifl,)exp[-i|r] = fe, 
CiJC- 1 = «p [i| T] op [i^N] ijexp [_i*N] op [-i^T] 

= exp[i|T](ifet)exp[-i|r] = at 

C^C - x = <p' ] , Similarly Cq> ] C~ 1 = <p 

Problem 9.8: As the two jr°'s are identical particles, their wave function must be 
totally symmetric, which means L = even. The angular momentum conservation 
requires J = I = 1 and the two conditions are incompatible. 
Problem 9.9: The positronium is abound state of e~ e + and its C = (— 1) L+S . For 
I = 0, C = +, — depending on S = 0, 1 which allows 2y , 3y , respectivelybut not 



Chapter 10: Path Integral I 
Problem 10.2: 

J- = lim i J dt L(q(t) + ed(t- t'), q(t) + e — 6(t- t')) - L(q(t), q(t)) 



6q(t') 

TdL , dL d „ . 1 31 I d 311 



"/' 



Problem 10.4: Writing down the solution as x = A cos to + £> sin cot with the 
condition x(t; = 0) = qi, x(tj = T) = qj gives 

flf - a; cos coT . 
x = Oj cos cot + -^ — ^ — smoJt 

Then calculate 



■M^-^] 
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Problem 10.7: (4) The average energy of a photon at temperature Tis 
haft hoi hap 



2 e*«/>-l 

The number of photons per unit phase space is 2Vd 3 p /(Ijzh) 3 = (8jt/c 3 ) Vv 2 dv 
where the factor 2 represents two polarization states. Expressing hw = hv and 
omitting the zero point energy (the first term), the average photon energy per unit 
frequency per unit volume is given by (8jih/c i )v i dv/(e hv / kT — 1) which is the 
Planck's formula. 



Chapter 11: Path Integral II 

Problem 11.1: The function/is expressed as a polynomial of tjj's, but from the con- 
ditions f drj = and anticommutativity of all the G-variables, the only surviving 
term in the integrand is one which includes all the rj^ ■ ■ ■ t]y. As 



7i---? N =n(x>~Wj*) 



The right hand side is a polynomial of §'s, but as any overlap of the same variable 
(|) 2 gives zero, only a term which includes all the fi • ■ ■ £ N survives. Therefore 

= (detL/- 1 )| 1 | 2 -..| N 



Writing the coefficient of the surviving polynomial as A, 

=j G yn^Adet(u- i )i 1 ,---,i N , 

On the other hand, because of Eq. (11.85b) 

j dr ll ---d VNVl ... VN = j d^...d^ N ^...^ N 

Comparison of the two equations gives J G = detU. 
Problem 11.2: Consider an orthogonal transformation 

tj = O t £ , ij T Atj = i T OAO T § = | T A'£ 
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As n-dimensional integration survives only when the integrand includes all the 
r/i's, only n/2-th power of the expansion si 






(-i)" /2 



The equation follows because 

1. Since O is orthogonal transformation Jacobian J = det = 1. 

2. In the expansion only terms that have exactly n factors can contribute to the 
integral. 

Now any antisymmetric (n x n)-dimensional matrix A with even n can be brought 
with a unitary transformation to the form 








for odd n the matrix has one more row and one more column with all zeros. Then 
the determinant is easily obtained from the above form (See proof of Eq. (1.3) be- 

) 

detA=detA' = i (fll)2 """ K/2)2 

(0 n = odd 

Assuming A' has this form, 






o" 










n/2 






'.-(^/«.-«.(-b T «) 

There can be no two equal |'s in the integral. Since § appear always pairwise, 
they can be commuted to normal ordering £„ • • • fi with a sign change (— l)"' 2 . 



The product a^ ■ • ■ a n / 2 gives just VdetA'. Then the role of si 
(n/2)! with the same result. All things put together we obtain 

I„ (n even) = / dt]i---dt] n e~'' 1 Al > 

= / d£i ■ ■ ■ |„ | n ■ ■ • |i vdet A' = VdetA 

The result is to be contrasted with the normal (or bosonic) Gaussian ii 
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just to multiply 



! dxi- 



Vdet A 



(Proof of Eq. (I.3)>: First consider the simple case of n = 2. 



For the general case, we use the fact that i A is a hermitian matrix. It can be diago- 
nalized by a unitary transformation V 

ViAV 1 = D 

where D is real and diagonal whose elements are solutions to the secular equation 

det|iA-2l| = 

Since A T = -A, we have det \(iA- 2l) T | = det | -iA- Al| = 0. Therefore, if I is 
a solution, so is (—X) and D is of the form 



From this 
Eq.(I.3),v 



sees that det A = 
e the 2x2 matrix 



" det D = (a a ■ ■ ■ a„) 2 . To put D in the form 



which has the property 
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Thus S(-iD)S'< = A' for 



Then the matrix A' defined by 

A' = S(-JD)S'~ = S[-iV(iA)V<]S"< = {SV)A{SVy = UAlP 

reproduces Eq. (1.3). Since both A and A' is real so is U, namely, U is an orthogonal 

matrix and Eq. (1.3) is proved. 

Problem 11.4: We start from the second order interaction Z 2 given in Eq. (11.168) 



-S{/< 



,-)G*jk)}" 



>))\isj»(y 

I - / d 4 xd 4 z 1 d 4 z 2 d 4 z i Tj(z 2 )S F {z 2 -x)(-iqy^) 
x S F (x - z 3 )?7(z3)D F/ „,(x - Zl );"( Zl )Z [?7, r,, J]\ 



= T!/ d4y (-T^)) H ^G^(y) 



(1.4) 



To make an internal photon line, d/dj fl (x) has to act on ] v (z\), producing 
Dpfipix — y). There are two ways to make the electron self energy contributions: 
(a) one pulls down another electron line and connects it to the outgoing electron. 
Namely, d/dt](y) acts on 77(23), and d/drj(y) acts on (dZ /dJf). (b) the other con- 
necting the new line (<3Z /<5?/) to the incoming electron of the first vertex. The 
process (a) is expressed as: 

N f A — 

Z 2ba = -— / d 4 xd 4 yd 4 z 2 d 4 w{ri{z 2 )S F (z 2 -x){-iqy>') 

x S F (x - Y){ -iqyP)S F (y - w) V (w)D Fup (x - y)}Z 

The process (b) is given 

N f 4 — 

Z 26 b = -y J d 4 xd 4 yd 4 z 1 d 4 w{rj{w)S F (w-y)(-iqy p ) 

x S F (y - x)(-iqyf)S F (x - z 1 )tj(z 1 )D ] , ftp (x - y)}Z 

By changing the integration variables we see that Z 26 i, is identical to Z 2ba and that 
serves merely to cancel the factor 1/2. This is a correction to the electron propaga- 
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tor. Since the electron propagator is given by 

<fl|T0(,,fty,]| fl , = (I)' ^Sw) mtlJ] L^o 



iS F (x -Y) + J d 4 wd 4 z S F (x - W )(-iqyf)S F (w - z)[-iqy v ) 



Using the momentum representation of the propagators Eq. (11.65) and 
Eq. (11.119) and integrating over w and z, the second term becomes: 



f d 4 P - 
J (In) 4 

"[/ 



i- 



1 ■ v~\ ! 

(-iqy v )\ 



Then the momentum representation of the electron propagator including the sec- 
ond order correction is expresses as 

Problem 11.5: The reduction formula for the Compton scattering 

y(kX) + e(pr)^y(k'X') + e (p's) 

where X, X' are the photon polarizations and r, s are electron spin orientations is 
given by 

ont (k'X'p's\kA,pr) m 

= J d 4 x 1 d 4 x 3 d 4 x 2 d 4 x, £ *(k , ,X , )f k ix 3 )(d p dP) x ,U p ,(x 4 )(if- m) X4 

x d 4 mv,j] I 

d V (x 2 )dJK( Xl )dri(x 4 )dr(x 3 )\- =ll=J=0 

x{ -if- m)x2 U p (x 2 )(ty } ) xl£fl (k,A)f k (x l ) (1.5) 

The generating functional that contributes to the Compton scattering was given by 
Eq. (11.171): 

Zcompton = i / d 4 xd 4 yd 4 z 1 d 4 z 2 d 4 w l d 4 w i lj"(w l )D Fop (w 1 - y) 

x D Fa/3 (x - z 1 )jl i (z 1 )lf(w i )S F (w i - y)(-iqy») 

x S F (y - x)(-iqy")S F (x - Zi ) V (z 3 )}z ft, rj,J] 
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As the photon source Jf has no directionality, namely it could serve either as a 
source or a sink, there are two ways to make the photon derivatives producing two 
distinct scattering processes as depicted in Figure 1.1. 
Application of the four-fold derivative in the reduction formula to Z Co mpton yields: 



k)L;.o = i / d 



dt } (x 2 )dj!'(x 1 )dr}(x 4 )dr( 
x [{D fvp (% 3 - y)D Fa/ ,(x - *i)Sf(*4 - y)(-iqv") 
x S F (y - x)(-iqy")S F (x - x 2 )} + [D Fvu ( Xi - x)D Fpfl (y - Xl ) 
x S F (x A - y)(-iqy»)S F (y - x)(-iqy")S F (x - * 2 )}] 

Inserting the above equation in Eq. (1.5), we obtain the scattering amplitude for the 
Compton scattering 

oui (k'l',p's\kl,pr) in = i f d 4 xd 4 y 

xU p '(y)[f£[y)Mx)e* v (k'r)(-iqy v )S F (y - x)(-iqy") £fl (k,l) 
+ fk*(x)fk(Y)z fl (m-iqYnSF(Y - x)(-iqy")eUk',r)]u p (x) 
Insertion of 

f k (x) = *-*«, U p (x) = u r (p)e-^ , 
S F (x 



Y > J (2xf pi-mi + ie 



and the integration of x and y yields two d functions and after integrating over p 
gives the final form: 

out (k'r, p's\ kX, p r) m = -(27t)Hd\k + p-k'- p')e 2 u s (p') 

[ I {p + k) 2- m 2 +iE 1+ l {p _ kf - m 2 +iE I J W 

where e/= e fl (k,A)y f ' , p' — e v (k',X')y v . The Compton scattering amplitude 
Eq. (7.51) has thus been reproduced. 



Figure 1.1 Compton Scattering. 
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Chapter 12: Accelerators and Detectors 

Problem 12.3: 

As the mean n is large and the standard deviation a/n = 1/^/n is small, r is large 
and distributes around n. Let 

F(r)= In ^e-A~r]nn-n-\]nV2n+(r+^\\nr-r\ 

Expand F(r) in Taylor series around r = n and use up to quadratic terms in (r— n) 2 

F(n) = -In v&, F» = -i-, p» = - (I - -L j 

, / 1 1 /l 1 \ , 

F(r).-lnV2^--(r-n)--(---j(r-n)^ 

= -lnV2^-i-(r-n)(r-n + l)+^(r-n) 2 

Neglecting the third term and approximating n + 1 ~ n, we obtain 



Chapter 13: Hadron Spectroscopy 
Problem 13.1: 

1. The total angular momentum in the final state is J = L and must be zero. The 
parity of the final state is PiP 2 (-l) 1 = PiP 2 = P x 2 = +1. The last equality 
holds because the two particles are identical. 

2. Let the orbital angular momentum of two particles I and that of the relative 
motion between the CM of the two-particle and the third particle L. From the re- 
quirement J = I + L = 0, £ = L. Then the parity becomes (— 1) 3 (— 1) £ + L = — . 
The problem 2) and 3) were used to prove the parity violation in the weak decay 
of K° ->• 2jt/3jt. 

3. We set the z-axis along the photon direction in the rest frame of the initial 
particle. Then J z = L z + S z = S z / 0. 

4. With the same setting as the problem 2), the angular momentum conservation 
requires £ + 1 = 1. Then I = L, I ± 1, therefore both parities are allowed. 

5. 1 + 1 = 1-^1 = 0,1,2;P= (-1) 2 (-1) L . .-. L = 1 is allowed. 

6. The orbital angular momentum L ; in the initial state can be considered as zero 
which means P is (—1). As the final state consists of two identical bosons with 
S = 0. L must be even which means P = + and the parity is violated. 
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Problem 13.2: The decay to 2y is obviously of electromagnetic origin. With the 
same logic as used to determine the spin and parity of jt° (see arguments in 
Sect. 13.2), the decay mode rj -> 2y tells that the spin of rj cannot be 1, and that C- 
parity is (+). Since 1 = the G-parity is G = +1. However, the 3jt final state has 
G = — 1 which means 1=1. Therefore, the decay violates isospin and G-parity 
and cannot be strong. In fact, other branch modes have comparable branching ratio 
with the 2y mode and are consistent that they are all induced by the electromagnet- 
ic interaction. As the electromagnetic interaction conserves parity, the spin parity 
of t] is fixed to be J p = — . 
Problem 13.3: From Eq. (13.34) Eq. (13.35) 



T(pd^ i Hjt+) = j2jlT V2 
T(pd^Hejt°)~ JT/3T y2 



11.0) 
= -V2 



pp = \l/2, l/2)|l/2,l/2) = |1,1) 

np = |l/2,-l/2)|l/2, 1/2) = (|1,0) - |0,0»/V2 

dn+ = \l,l), d7i° = \l,0) 

T(pp -> dji + ) Ti i- da(pp -*■ dn + ) 

T(np -» djt°) ~ Tl /V2 " ~* do(np^djz°) ' 



K~n = |l/2,-l/2)|l/2,-l/2) =|1,-1) 

K~p = |l/2,-l/2)|l/2,l/2) = (|1,0) + |0,0»/V2 

K°S~= |l/2,-l/2)|l/2,-l/2) = |1,-1) 

K+S~ = |l/2,l/2)|l/2, -1/2) = (|1,0>-|0,0»/V2 

K°3° = |l/2,-l/2)|l/2,l/2) = (|1,0) + |0,0))/V2 

.-. f(K-n^K°E-) = T 1 

f(K~P -> K°S-) + f(K~p -+ K°S°) = (Ti - T )/2 + (T, - 
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K+A = 1 1/2, 1/2), KM" = 1 1/2, -1/2) 

n+n= |l,l>|l/2,-l/2> = /l/3|3/2,l/2> + V2/3|l/2,l/2) 

7T-p = |l,-l)|l/2,+l/2) = yi/3|3/2,-l/2) - V2/3|l/2,-l/2) 

T(jr+n^ K+A) _ y^/3 
"' T(w-p -* K?A°) ~ -^2/3 ' 
->• dcj(jT + n -► JC+yl) = do(jz~p -► K°A°) 

Problem 13.8: 

(a) The real part and imaginary part of the scattering from Eq. (13.91) give 



r] = 1 for elastic scattering and r] < 1 at higher energy where inelas 
occurs and is a function of energy. 
When d = f , f , ^, 



(ReT= -,ImT = 0.5) , 
(ReT = --,ImT = 0.5) 



and this means each trajectory circles around the center of the Argand diagram 
(counter-)clock wise when the force is attractive or repulsive. This is the case for 
a single amplitude, (b) When there is a non resonant repulsive force which does 
not change appreciably as a function of energy, the total amplitude is sum of the 
two (with proper normalization to lie within the circle). We consider, for simplicity, 
for the case when the total amplitude is the average of the two [see Fig. 1.2]. Then 
examples in Fig. 13.15 show various cases satisfying conditions (b-d). 
Problem 13.9: Denote 3-momentum and kinetic energy of 3 pions as fc; and T; 
which satisfy 

k l + fe 2 ; fe 3 = , Tj + T 2 + T 3 = m K - lm K = Q 

k 2 
T; = ki : ultra-relativistic T; = — — : nonrelativistic 

We show the boundary in (x, y) = {(Tj — T 3 )/-n/3, T 2 } plane becomes a circle for 
nonrelativistic case. 

fc 2 = -(fej + fe 3 ) -+ kl = kl + k\ + 2kife 3 cos § 

Therefore 

I T 2 - T a - T 3 1 1 



T 2 = T x + T 3 + 2 /T7T7 cos f - 
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r: (a) Single midpoints connecting 1-1, 2-2, etc. (d) is 

mplitude. (b) Repulsive force + ilar to (b), but here the resonant amplitude 

=, (c) Attractive force + resonance, progresses slowly while the non-resonant 

(d) Repulsive force + resonance, (b-d) Points amplitude swiftly and the relative velocity is 

1,2, •• • denote two amplitudes at the same reversed at high energy, 
energy. Points 1', 2', • • • on dashed lines are 



The boundary is obtained by putting cos § = 1 

T 2 + T 2 2 + T 3 2 - 2T 1 T 2 - 2T 2 T 3 - IT^ = 



l+ { T 2 (r 2 -Q)-T 1 r 3 } 



- (T 2 + T 2 2 + Tj - 2TjT 2 - 2T 2 T 3 - 2T 3 Tj -i 
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Namely, in the (x, y) plane, it is a circle with center at (x, y) = (0, Q/3) and radius 
Q/3. 

Ultra-relativistic: From 
f-fc 

2fe 1 fe 2 
r i = fe i ^y = T 3 >|T 1 -T 2 | = V3|x| 
Therefore, y is above lines AC and BC. Next, 

T 3 = Q- (Tj + T 2 ) = Q- (fe x + fe 2 ) < Q- fc 3 = Q- T 3 ,^ y = T 3 < -| 

Namely, y is below the line AB. 

Problem 13.12: Setting a = 2mV(r), the Schrodinger equation for radial wave 

function is given by 

-u'l + au k = k 2 u k , a = 2mV(r) (1.6) 

-K = fe V P-7) 

From Eq. (1.7), we have 

y fc = sm ( r + °* =cosfer + sinfercot<3 ~ l + (fccot<3 )r (I.8a) 

.-. lim vi = v = 1 - r/a , lim fc cot 6 = -l/a (I.8b) 

where we have used linn^o <5 oc fc. In Eq. (1.6) a solution u for fe = satisfies 

- M ' ' + au = 0, .-. a = u%/u Q , (1.9) 

Making (1.6) xu and inserting Eq. (1.9) 

-u u' k ' + u au k = k 2 u u k , ^[u' u k -u u' k ]' = k 2 u u k (1. 10) 

Similarly 

[v^n ~ v v' k ]' = k 2 v v k (1. 11) 

Making (I.ll)-(I.IO) 

[VqV* - v v' k ]' - [u' u k - u u' k ]' = k 2 (v v k - u u k ) 

Integrating both sides 

. „ f R 
[v v k -v v k -u u k + u u k } = / (v Vfe - u Uk) 

Calculating l.h.s. explicitly using Eq. (1. 8a) and Eq. (1. 8b), we have 



- + fccot<5 = fc 2 / dr (v v k - u u k ) = k 2 dr (v v k - u u k ) 
» Jo Jo 

where we used u = u k = at r = and u k = v K at r = oo. In the limit fe ->• 0, 
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Chapter 14: Quark Model 

Problem 14.1: 

We show for the case of antisymmetric tensor. A' J -»• A' ' J 

A' ij = U i a U j b A ah = -U i h V i a A ha = -U i a U i h A ah = -A' ji 

In going to the second last equality, we used that suffix for sum can be freely 

changed. For symmetric tensor Sjj, the proof is similar. 

Problem 14.2: (J 3 , Y) of A 12 = -^(ud- du), A 2i = -Wds-sd), A 31 = 4( SM - 

us), are (0, 2/3), (—1/2, —1/3), (1/2, —1/3) which are the same quantum numbers as 

those of l,u, d. Therefore, this three members belong to 3*. 

Problem 14.3: 

Xjj[ = Byfcf'f'-'!'* = E 1]k U' a W b U k c ^^ b ^ 

= detll/'^^^r = e Bie | B |*| e = Vi 

Problem 14.4: Inserting quantum numbers (Y, I) = (1, 1/2), (0, 1), (0, 0), (-1, 1/2) 
of (N,S,A, S) into Eq. (14.31), we obtain 



h c/2 , 



m s = a + 2c 



? = a - h + c/2 . 

s can be ob- 



Eliminating a, b, c leads to Eq. (14.30a). Equality (14.30b) for n 

tained by changing m N -» m|, m s -» m 2 ,,, m^ -» m 2 , mj ->• m 2 ,. 

Problem 14.5* : This takes an elaborate algebra. 

Tensor Method: Baryons are composed of q l qi q k , but as far as SL/(3) properties 

are concerned, it is convenient to use the equality Eq. (14.12) with meson octets 

substituted with baryon octets. 



= (*/)* 



-2- 4 + 4 



-m 



-r& 



As the mass term is expressed as a tensor B B£, the solution is obtained if we can 
make a combination that transforms as a singlet and the 8-th member of an octet. 
As stated in Sect. 14.1.3, meson singlet and octet can be expressed as 



(Mg)<. = ? %. - jdjgf. = (did) - jd)d^j q a q b = (0)) b a q a q h (1.13) 
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Therefore, to reduce an arbitrary tensor to one that transforms like the mass term, 
we make contractions of any (i, j) [or any pair of (i, b), (a, j),(a, b)] with S- and 
(Of )'■ and remove residual suffix (a ■ ■ ■ , b ■ ■ ■ ) dependence by using <5 jj, e abc , € tt y, c . 
Consequently, the mass term takes a form 

A'iB'jBJ} + B{(OlYj~B J a B^} + C{(Of)}B" B ] a } 

= A{B) Bj} + (Ol))[F'{B J tt Bf - ~BlB{} + D'{B J a B? + 1" B{}} 

As the second term in O] [i5|(5 ^/3] has the same transformation property as a sin- 
glet, we move it to the singlet term and consider only the first term as the octet 
term, namely we are allowed to change (0\ )£ -*■ d® d\. In summary, the Hamilto- 
nian mass term becomes 

M = A{B'j BJ } + F(B~l B 3 a - ~B 3 B\} + D{b\ B 3 a + B 3 B\} 

= A{pp + nn + I~I + + S°S° + I + S~ + A°A° 



+ F{pp + nn-E E u -3 E }+ D{pp + nn + E E u 

u 4—o „ 

+ E + E-+-AA } 

.-. m n = A+ F + D , M E = A-F + D , M z = A , 
M A = A+ -D 
Eliminating A,F,D, we finally obtain 
M N + M E = -(M s + 3M A ) 

Method Using U Spin: We consider to express the mass in terms of quantum num- 
bers of S U(3). Conventionally used (7 3 , Y) are based on subgroup S U(2)i x 1/(1) y. 
As ipAgip has the quantum number Yand does not differentiate A from E°, we 
consider a different subgroup S U(2)uX U(1)q. S U(2) v is symmetry of d and s and 
is differentiated by U 3 . U 3 of (d, s) is (+1/2, -1/2). (d, s) has U-spin 1/2 and u is a 
U-spin singlet. The electric charge Q differentiate (d, s) and u. In terms of U 3 and 
Q, A 8 is reexpressed as A 8 = V3Y = (2/V3)(Us + V 3 ) = (4/V3)(U 3 + 1/2 1 3 ) (see 
Eq. (G.17c) (G.17d) in Appendix G). Using (Q, U). The octet members are classified 

(Q, U) = (1,1/2) -+ (p,£ + ) , (-1,1/2) -> (X-.B-) , 
(0, l)^(n,2<U ), (0,0) ^~° 

U ,^ are eigenstates of 7 3 , but not of U 3 . To obtain U 3 = eigenstate, we apply 
U- = (A s — ^7)/2 to U = U 3 = +1 eigenstate i.e. \n). The operation is analogous 
to obtain J 3 = state from I = I 3 = +1 state by applying I_ = (Ai — A 2 )/2. 
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I_, U- operations are to exchange (u -* d, d -» — u), (d ->■ s,~s ->• — d). Similar to 
the meson octet for which 

| 7 °) = |J :0,0> = (uu+dd-2ss)/V6 
7_|jr+> = I-(ud) = -(uu- dd) = V2[(-uu + dd)/V2] 

= V2|I:1,0) 
| U : 0, 0) = (-2uu + dd + ss)/V6 
U_|U : 1,1) = U_ | JC°> = U-(d-s) = V2[(-dd + ss)/V2] 
= V2| 17:1,0) 
From the above equation, we can obtain 



Substituting corresponding baryon octets in the above equation, we have 

.-£, 

As the mass of particles within a multiplet can be expressed as a + bU) + c/ 3 , 
neglecting 7 3 

M n = (U: 1,1|M|U: 1,1) = a + b 

(U: 1,0|M|U: 1,0) = /--2° - — /1° M --Z - — /1°\ 

1 3 

= -M^o + -M^o = a 
4 4 

M £ o = (U : 1,-1|M|U: 1,-1) = a-b 
Eliminating a, b, we finally obtain 

M N + M s = -(M s + 3M A ) 

Problem 14.6: Substituting Eq. (14.33) in Eq. (14.34a), 

[Ml MiJ_\o sl\M 2 Oip -si 
k 2 s Mi\-[-s c\[0 Mt\[s c\ 

= I" c 2 M 2 + s 2 Ml cs(-M$ + M 2 J~\ 

~ I cs(-M 2 + Ml) s 2 M 2 + c 2 Ml \ 

where c = cos 9 , s = sin 8 . On the other hand, from Eq. (14.35) 

U\ - Mj = M 2 - M 2 W = s 2 Ml + c 2 Ml -Ml = s 2 (Ml - Ml) 
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A «f» n <]> 



n 

1 n Figure 1.3 Left: K~ p ->• (pA, Right: ji~ p - 



Problem 14.7: Using the quark model, K~ = su, p = uud, <p = s's, A = uds, 
ji~ = du, n = udd, reactions can be described as in Fig. 1.3. Namely in the 
process K~p -» <pA, quark lines are connected but in n~ p -> 0narenot. 
Problem 14.8: The decay process of a vector particle into a lepton pair is one where 
a pair of q q annihilates into a photon, subsequently decaying to a lepton pair. This 
means the decay amplitude is proportional to the quark charge. Since pair annihi- 
lations uu and dd cannot be distinguished, the two amplitudes have to be added in 
the decay process of p, w = (uu ± dd)/ V2 and </> = s ^- They are proportional to 

\—\--(--\\\ 2 \—\- (--\\\ 2 |-I| 2 -I — - 

Problem 14.9: The decay rate Eq. (14.40) can be derived as follows. We consider a 
vector V(J P = 1~) or a scalar S(0~) meson which is a composite of quarks (where 
m q 3> m; is assumed) in the relative orbital angular momentum S-state. The cross 
section qq — >■ 11 is given by Eq. (C.17). 

cttot = -^— Q 2 ^ 1 i + = + : - 



and Qis the electric charge of the quark q. The cross section is a sum of spin triplet 
cross section a t and spin singlet a s , i.e. a = |cr t + |ct s . For the vector meson 
decay V -»• ll, cr s does not contribute. Hence the decay rate r in the limit fij ->• 1 
(s = 4m 2 q » mf) is given by 



= (flux)x CTt = 2/5^(0)| 2 -a 






I 



where we have put s = m\, /3 ; — >- (nonrelativistic limit). i/;(0) is the S-state wave 
function at the origin whose shape depends on the potential acting on the qq pair, 
it is smaller than Eq. (14.40) by a factor three. This is because the quark has three 
colors which shall be explained soon. Three identical diagrams contribute to the 
total decay amplitude, therefore the decay rate has to be multiplied by 9 and has to 
be divided by 3 because flux of each color is 1/3 on the average. 
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Problem 14.10: Performing cyclic replacement starting from 

(ud - du)u(U -|t)t = «t<Hwt+<Uwt»t- 
-<JtiU»t 



utdiuf+diufuf-uidfuf-dfuiui 
+ u J [u\di+u\d\ f u J [-u\u\ f d J [-u J [d\ui 
+ diu\u J [+u\u\di-d J [u\u] f -uiu\d J [ 

= 2u"[u\d[-u'[u\ f d\-u\ f u'[d / [ 
+ 2u J [d] f u\-u] f d J [u J [-U J [d J [d\ f 
+ 2d I u f u t -d t u t u 4, -d t " 4 " t= Eq. (14.52) 

Problem 14.12: If one makes wave function of the neutron, one sees it has permu- 
tation symmetry u -e* d. Therefore, the magnetic moment of the neutron fi„ can 
be obtained from fi p by exchange of u -e* d. Magnetic moments of other baryons 
having two identical quarks out of three can be obtained in a similar manner. 

[t n = [t p (u±* d) , fx E+ = fi p (d^ s) , [tx- = fi n (u^ s) 
[i s o = fi n (d ->• s) , ^ s -=/i p (u->-s) 

As £2 ~ is made of sss and has spin s = 3/2, its wave function is given by s f s f 
s j\ Therefore the magnetic moment is given by ,mq- = ifi s . 

The quark components of S°, A are uds and the same for both, but they have 
isospin j = 1,0 hence the wave functions are symmetric (antisymmetric). Then, 
by symmetry, the spin wave functions of E°(A°) are also symmetric (antisymmet- 
ric). The structure of the spin wave function of 2° then is the same as that of the 
proton in Eq. (14.52) and that of A is antisymmetric and has the same structure 
as Problem 14.10. 

\Z°) = 



As a preparation for calculating fix a, let us use a method different given in the 
text. The Hamiltonian for the magnetic moment in the quark model is given by 

H = fi u o z (u) + fi d a z {d) + fi s a z (s) 

a z (u) means that it acts only on u quark. Since a z \.= — \., we have 

h\i°) = ^L"»{2 m -(u - it) t> 

+ ^{2tH-(-U + 4-t)t} 

+ ^ s {-2tU-(U + lt)t}] (I.M) 
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By multiplying (Z°\ from left and calculating the expectation values, 
4 4 / 4 1 1\ 2 1 

Similarly, multiplying (A | by 

+ /MU-lt)t} 

we obtain /^ = // s . 

Multiplying Eq. (1.14) by (A°\, we have 

1 1 -fi u + [*d 

^=-^.+ ^W = — ^- 

Problem 14.13: The gluon has colors (8 species), hence ee cannot be converted to 
a gluon. As the gluon has the same C-parity (—1) as the photon, J/ip having the 
same quantum number as the photon cannot be converted to 2 gluons. Hence, 
three gluons have to be exchanged. 
Problem 14.14: 

..-J = £{(« + . j) a -?-5> 

= I{S(S + l)- Si (s j + l)-s J .(s i + l)} 

= U{l(l + l)-2xi(| + l)} = i S = l 
ji {0(0+1) -2xi (I + 1)} = -| S=0 



I>>- S J>= 2 {(Sl + S2 + S3>2 ' 



2 

(l{|(|+l)-3xl(i + l)} = 
■Jl{l(I + l)-3xl(i + l)} = 



Problem 14.15: 



As A consists of u, d and has spin s = 3/2, the first term gives 3m u . Using 
Eq. (14.98b) the second term becomes | -^ T . Therefore, 



m( A) = 3m u +B(-—\ 
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The equation for N can be obtained from the above formula by changing £(s; ■ 
Sj ) -> -3/4 (Eq. (14.98b)), m(N) = 3m u +B (-f ^) 
As E* consists of u or d + 2s and has spin s = 3/2, 

M = m„ + 2m s + gj^-^ + ^"^ i ■ 

Any quark pair makes spin s = 1, (s s • s s ) = (s„ • s s ) = 1/4. Therefore, 

S has spin s = 1/2, but as the pair ss is symmetric, so is the spin wave function 
hence its spin has to be s = 1. 

2s„ • s s = 1 {(s M + s s + s s ) 2 - s 2 , - (s, + s s ) 2 } 

As £2 is made of sss, the mass can be obtained from that of uuu = A ++ by 
replacing u -► s. 

••• ™(0) = 3m,+ B(!ij) 

Problem 14.16: 



= si , S = 1 — ► m, p = 2m s + C ? 

K* = s n , S = l^m P = m„ + m s + C ^T^T 

co = (uu+dd)/V2, S = l — > m ut = 2m u + C ^ 

P> S = 1 -> m p = m ffl 

As j; = (uu+ dd — 2ss)(v / 6), it has mass m, with probability 1/3 and has mass of 
a particle with ss, S = 0. Since s • s = —3/4 

3/1 1 \ 



-(m M + 2m s )-C- 



, S = — ► m* = 2m,- C- 
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Chapter 15: Weak Interaction 

Problem 15.1: From Eq. (15.23), the most general Lagrangian for the beta decay can 
be expressed as 

£ = Gp X>/?n)[Q(?/» + C[(jr iY \)\ + h.c. 
r, = l(S), y"(V), o» v (T), y"y s (A), iy 5 (P) 
r; = l(S), y fl (V), o flv (T), y llY \A), iy 5 (P) 
h.c. = G li J2^Y (r;yy p)[C*(vy r^y°e) 

+ (Cl)*(vy°y 5 >y y r?y e )} 

Using relations y°r^ y° = /";, y°y 5 y° = —y 5 . 
The hemitian conjugate becomes 

h.c. = Gp ^(nr/p)[C*(vr;e) - (C-)* (vy s r-e)] (h.c. of the first term) 

P: P transformation changes (see Appendix F) 

U'ir[rp2 — >• ipiy°r[y°ip2 = ep;Vi^iV2 

^i-^y 5 ^ -» ^y^yWy ^ = -sp^rfy 5 ip 2 , 

£pi denotes sign change depending on transformation property of/";. For instance, 

e pi = +1 , for i = S, V, T, -1 for A, P . 
Since £ P ;(;pT,n)£p;(e,rV) = fp7Yn)(e/», we have 

fj/;' W )[Ci(?r j v) + c[(?r i y 5 v)] 

4 (ep.pr.nJIQfep.er/v) - C[{ E? -cr[y 5 v)] 
= fJ/>)[C; (?/>)- C^lVv)] 

Namely, the sign change £p; due to different F\ and exchange operation T; ** f/ 
do not make any difference in the final Lagrangian. Only the sign change by y 5 -»• 
yOySyO _ _^s j n me seconc i term in [■ ■ • ] transforms C ■ — >• — C ( ' . Therefore, we 
have shown that P invariance means C[ — 0. 
T: T transformation changes 

Cif^iipi ->■ C*lp l B~ 1 r*Bip2 = C*£ T ^ x r[ii>2 

:. (pr»[Q(?r lV ) + qV,? s v) ] 4 ^B-\r[)*Bn) 

x[C*(eB- 1 J T*Bx') + (C 1 ')*(eB- 1 r*BB- 1 (y 5 )*Br)] 
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Inserting B~ 1 y fl *B = y fl , B~ 1 y 3 * B = y 5 , the above equation becomes 

= ( £ri pr,n)[C*( £Tl e/» + (C()*( £Ti ?/yy 5 v)] 
= (]TJ»[C*(e/>) + [Cl)*(-gr iy 5 v)} 

This shows that T invariance means Q = C* , C[ = (C/)*. 
C: C transformation changes 

CiVi^i $■ -CiXplC^TiCxpl = -QeaipirTipl 



:. (prfn)[Ci(er iV ) + Q(er iY s v)] -> (-pC-Ti'CH) 
x[-Ci(eC _1 riCr)- C^eC^r.CC'VCi 7 )] 

Considering C = iy 2 y° C — ^CC - 1 y 5 C = £c;^ T (y 5 ) T - 
The above equation becomes 

= (ecinr;p)[C i (ecivr i e)+ C^cFy 5 ^)] 
= (nr i 'p)[Q(vr ie ) + C' i (vy s r i e)] 

Comparing with Eq. (h.c. of the first term), it means C; = C* , C[ = — (C[)* 
Problem 15.2: Inserting Eq. (15.25c) into Eq. (15.18) in the text 

dr = G 2 (C 2 F \(1)\ 2 + C 2 r |<<r)| 2 )(l + vcos 6) 

x ^pE(A - E) 2 dEdQ e dQ v 



1(1+ v cos 6)dQ e dQ v = 1(,ji 2 , J E 2 (A-E) 2 dE 



e obtain 

GJA^ 
60jt 3 



■/' 



(C 2 \(l)\ 2 + C 2 GT \(a)\ 2 ) 



Problem 15.3: Only variables that are not integrated in the integrand are A — 
pi — pi = p 2 + Pi- Lorentz invariance requires the integral is a tensor of rank 2 
and can be put in a form 

S" v = Ag>' v + BA"A" 

In order to obtain coefficients A, B, we calculate C = S""g /lv , D = S' IV A U A„ 

« 1 ■> i i A 2 

PzPigflv =p2'p3= 2<^ -Pl-Pl)= -J- 
u 1 A 2 

p f 2 plA fl A v = (p 2 ■ A){p 3 ■ A) = {p 2 -p i ) i = — 
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In CM of the particles 2 and 3, (E 2 +Ei) = 2p.Then/ ^^ 1 d A {A—p 2 — pi) = § 

, jzA 2 
(Ag + BA^'A v )g flv = 4A + BA 2 = 

(Ag u " + BA f 'A v )A t ,A v = (A + BA 2 )A 2 = - — 



24 ' 12 

Problem 1S.4: Denoting energy momentum of the i 
and that of the electron as (E, p), Eq. (15.82) is wri 



df 



G u(Pn< sf ' v P4v) d i p A 



2 m, ,jt 5 E A ' 

p lfl S'"'p, v = £ {( Pl • p 4 )zl 2 + 2( Pl • J)(j» 4 •/!)} 
In the rest frame of the muon, 

j»i • j» 4 = m„i? , zl 2 = (ro„ - E) 2 - p 2 = m 2 - 2m„ £ + 

Pi-A = pl- Pl -p 4 = m 2 - m,, £ , 

j> 4 -^ = Pi-P*-Pl = m fl E-m 2 
Substituting the above relations in Eq. (1.15) 

Putting m 2 = 0, p = E = y-x, -g 2 - = 4jrp dp, we finally obtain 

dr = -?—^x 2 (3-2x)dx 
Problem 15.5: We insert the following relations in Eq. (1.16). 

* = ! T*' *=f^T^, - = (^) 

G>1 I i , \ 

dr = J £ |3(1 + a>) yV + 4co - 2(jc 2 + 4w) - 4w| - 

By making use of equalities; 

[_dx_ = lo I r^ — -| 

J V^ 2 + C d* = 1 {* V** + C + Clog \ X + V% 2 + c|} 

f x 2 Vx^T~cdx = 1 {2x(x 2 + C)i - cxVx^T~c 



:s rest frame as (m^, 0) 



(1.15) 
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we obtain 



Problem 15.7: We start from Eq. (15.114a) 

M(jT + ^ Jl \+ V) = ^|V2(fe'" + V)u(p c )y fl (l - y')v(p v ) 



' V2 



Inserting k' fl + k fl — (m 



n x°)(> f i0 



|(fc'" + fc'")u(p e )y„(l-y 5 )v(p„)| 2 

= (m,+ + m,o) 2 <5„ d FO Tr[(y e + m e )y"(l - y 5 ) /„y"(l - y 5 ) 
= 2(m M+ + m^) 2 d fl0 d r() rr[^y" / v y*] 
= 8(m n+ + m^) 2 [2E e E v -(p e -p v )] 
= S(m„+ + m„«) 2 E e E v (l + v e cos 6) 

r = (2jz)*d\A-p e -p v )^— 

d i p„ d 3 p e d 3 p v 



<£m<i- 



(Ijiflco (2jt) i 2E e (2uf2E v 

— / <5(zl - £ e - £ F )£ e £„(l + v 



^ 8(m jr+ + 
" (2jt) 5 16 

pjdpcdQ,, p\dp v dQ v 



= j ' 4 »^ jp c E e( A-E e fdE 
_G / ^ 5 (l-Zl/(2m jr+ )) 2 

R(X) = 30 L dyy^-yfjy 2 -* 2 

i t/ 9 , A 15 ,, 1 

= VI -x 2 1- -x 2 -4x 4 + — x 4 ln- 



The cos 8 term vanishes when integrated over solid angle. To calculate R(x), use 
y dyy y^I = (l/3)(y 2 - l) 3 / 2 

J dy y 2 v / y 2 -i= ^ [2y(y 2 - 1) 3/2 + y(y 2 - 1) 1/2 - In |y + Vy 2 -i|] 

J dy y 3 /y^I = (l/5)(y 2 - l) 5 / 2 + (l/3)(y 2 - l) 3 / 2 
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Problem IS. 8: As n is a boson having J F = 0~, its total wave function is symmet- 
ric. In order to have / = with two pions, the angular orbital momentum has to 
be zero, making the spatial part of the war function symmetric. This means isospin 
part of the wave function has to be symmetric, too. As I = 1 wave function is an- 
tisymmetric, jt + 7T° can be only in I = 2 state. Therefore, A I = 1/2 rule forbids 
transition K + ->• 7t + 7t°. 
Problem 15.9: 

1. From the previous problem, K s , K L can decay only to I = 0. I = wave func- 
tion can be constructed using the Clebsch-Gordan coefficients as 

|I:0,0)= -^(7Z + 7l~ - 7Z°7l° + 7l~ Jl + ) 

and decay rates to jr + jr — , 7i a 7t° , 71—71^ are equal = 1/3. But as observationally 
ti^ 7i— and 7i— ti^ cannot be distinguished, the decay rate to jt+jt - is twice 
as that to jr°jr°. Note that the decay K L -► 2tt does not occur if CP is invariant. 

2. We assume that the spurion s has J = 1/2, J 3 = —1/2. Combinations with the 
spurion are 

|sS-) = |l,-l) 

(s«-|T|yl^-> = (l,-l|T|l,-l> = T 1 
(sE°\T\A7r°) = 4|((1,0| + <0,0|)T|1,0) = ^-T x 

. r(E~ -> An-) _ I r(g- -> yiTr-) | 2 _ 



r(S° -^ ylTT ) I T( »° -► Atc°) I 

'- + >^|^) + /f|ri> 

'» + >4ik)+/iy 
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From above relations 

a+ = (I+\T\njt+) = 1 -T i + ^T i 
a- = (S-\T\nn-) = Tj 

fl° = (^+|T| i3 ^> = ^(r | -Ti) 

Problem 15.10: From Eqs. (15.132), we can calculate the transition amplitude and 
decay rate as 

m = ^if+( k K, + M + /-(**. - k»/.)}M(P»)y"(i - y s )v(p.) 

dr = ^- hM|_ dLIps = (2^) 4 a 4 (kjc - k„ - j» e - j»v) 

spin K 

Y^ \M\ 2 d i k yl d i p e d i p v 
^ 2m x (2n:) s 2« T 2£' e 2£' F 

spin > ' 

k£ + fc£ = 2fc£ - (k£ - fe;") = 2fe£ - (p£ + p£) 

The second term becomes m f: by operating on leptonic current. We set m t — 
from now on. 

£ |^w| 2 = 2G 2 |/+| 2 fc K/ ,fc K „ £ |u(p»)y"(i + y V(p e )l 2 
=0= i6G 2 |/ + | 2 fe K/ ,fe KF { P ^;: + P >f - g ""( Pe ■ Pv ) 

+ i£ f,Vp °p cp p va } 

= 16G S 2 | f+\ 2 {2(k K ■ p e )(k K ■ p v ) - m\(p e ■ p v )} (1.17) 

.-. dr = (2jr) 4 <5 4 (fc K - fe., - p e - p„) Gsl -^ +l 

, d^krrd 1 p r d 3 p v 

x 8{2(fe K • p e )(fe K • p v ) - mltp,, ■ p v )\ 1 — — F (1.18) 

1 l F n F ' KKF F " (2jzf2oj2E e 2E v { ' 

To calculate energy spectrum of the electron, one decomposes dF as follows. 
f a.. a. , „ d i k 7l d' i p v 

V= (k K -"p e ) 2 = m K (mx-2£ e ) ' £max = 2m, * 
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Substituting 

{2(k K ■ p e )k Ku - m 2 K p c/l }(k K ~ p t y = k 2 K (k K ■ p.) - 2(k K ■ p t ) 
= m\(m K -2E e ) 
inEq. (1.16), we have 

dr _ Gf\f+\ 2 m £2 (£max-£ e ) 2 
dE c 2jt 3 K c m K -2E e 

To obtain n spectrum, we rewrite Eq. (1.15) as follows. 

= 4m 2 K (oj 2 - ml) = 4m 2 K kl 
where q = fe^ — fe a - 



In order to calculate the total decay rate, we integrate the above formula over energy. 

Noting 

m k m„ 

(o mm = m„ = — 2y , y = — , 

^(i + y 2 ) 



2m K 

, G S 2 U- 



+ l 2 5 f 1+y ( / 2 Try , G 2 |/+| 2 /m 2 \ 

/(y) = l-8y+8y 3 -y 4 -12y 2 logy 
In deriving the result, we have used 

/"(x 2 + C) 3 ' 2 dx = ^[2x(x 2 + Cf 2 + 3Cx(x 2 + C) 1 ' 2 
+ 3C 2 log|x + V* 2 +C|] 

Chapter 16: Neutral Kaons 

Problem 16.1: Substituting p \ K°) + q\K°) = \ P in Eq. (16.10), 
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Solutions can be obtained by solving the secular equations. 

[(A u -X)p +A 12 q = 
{A 2lP + (A 22 -A)q = 

From the above equation, we get 

q _l-A n _ A 21 

p A 12 X—A 22 

X 1 - (A n + A 22 )X + A n A 22 - A U A 21 = 



^W(^) 



A S A L = A n A 22 -A 12 A 21 



Problem 16.2: Solutions to x \ A 2 are give by 



m- 



M\ 

\K 2 )-- 



V2L-1J 

Then to first order in d, the following equality is also true. 
[M + 8A 



j„m + -iH»^m + -i] 



is a solution in matter with eigenvalue M + A/2. Similarly 

\K 2M ) = -^ [liJ d ] = l K 2 > -<*|Ki) 

is also a solution with eigenvalue M — A/2. Note, as is show: 
represents (A s + A L )/2 - *£-{/ + J). 
Problem 16.3: Use Eq. (16.13) in the text, 
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U can be obtained as follows. Using Eqs. (16.65), xp = a\K°) + /3|K°) can be 
reexpressed in the | K s ), | iC L ) base as 

y = a'\K s }+p'\K L ) 

= a'(pVT^\K°) + qVm|"K°» 

+ p'(pVT+^\K°) - qVT^\K°)) 
= (a'pVT^ + l3' P VTT^)\K ) + (ct'qVTT^-qVT^)\K } 
= a|K°>+/5|K°> 



U = 



rwi- 



U~ 






The rest of the calculations is straightforward. 

Problem 16.5: From Eq. (16.52c), CP violation means there is a small imaginary 
part in M* 2 /\ 2 . .'. M* 2 r 12 = M 12 r 12 e lA $ A § <£ 1 where M 12 , -Ti 2 are signed 
real numbers. Writing Mi 2 = M u e'^ M , 7\ 2 = r 12 e^ M e 1,d ^ 

^12^21 = [(M 12 - ir 12 /2) (M* - ir*/2)] 1/2 
= M 12 -iT u /2 + 0(A% 2 ) 
.'. ^A = -2v%2^2i - -2 (m 12 - iA 2 /2) -> (16.81d) 

Problem 16.6: Use Eq. (16.79d) and Eq. (16.80d). 
Problem 16.7: Using 



We calculate the decay asymmetry d j. Denoting the final state phase space as p, 

6 _ r<K° -> f) - r(K° -> f) _ r(f -> /) - r(K° -> /) 
7 r("K° ->-/) + r(K° ->. /) r ( K s -+ /) 

' 1 -((/|r|;«c s ) + (/|T|K L »| P 
(/|t|k l )M 2 



2p 



1- 



arnxs); 



li-'?/! 2 



4|q| 2 4|j 

~ 2(Ree - Rer] f ) 



v/r \pr-\g\ 
Mp\ 2 \q\ 2 



(f\T\K L ) \ 

(f\T\K s )J 

' 2Re ?// 

4\p\ 2 \q\ 2 
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Now consider | /) = jr+jr - ) or yr°7r°). Referring to Eq. (16.120), we have 

7+- = £o + e' , e - 2c' 
.-. 8^a„o = 2Re(e - ?7oo) = 4Ree' , <3 + _ = 2Re(e - 7 + _) = -2e' 



Problem 16.8 



I (A S / - A Lf )/2q I = |j» 1-7/ 1 
| (A S / + A L/ )/2ff | | q 1 + v/l 



= 2Re(7 + _ - 700) = 6Ree' 



Chapter 17: Hadron Structure 
Problem 17.1: 

(17.8) : F(Q 2 ) = J 2' : 

p(r) = p r<R, p = r > R 

_ 4jrp 



/ drp(r) = 4-jip / r dr = - 
jdr P ^--^[, d rj + _;^ dz --^j\^r d r 
= ^f% S1 n|^ = ^ {sl n|-|cos| } 



(17.9) : p(r) = p e~ T l a 
\ drp(r) = 4jrp / e~ r/a r 2 dr = 8jrp a 3 

f drpoe~ r/ V a ' r = 2yrp / e~ T l a r 2 dr J t ,Q - rz dz 
i\l Jo 

_ iTTpo ( 1 



iQ l(l/a-iQ) 2 (l/a + i&) ^(l + a 2 Q 2 ) 2 

Problem 17.2: 

e ]K(x) = eu(ff') [Fi(«V - -^-F 2 ( ? 2 )i<7"" ? J u(j»)e'«» 
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We first prove d M Jf = 0. As 3 /( ->■ iq^.q^yf = f — p, inserting the Dirac equation 
u(p') p" = Mu(p'), p'u(p) = Mu(p), the first term vanishes. The second term 
qn^ v q v vanishes also from the different symmetry of q^q,, and a^ v . If there exists 
a term proportional to q /l , it enters in Jf as F 3 q? . Since, q^qf = q 2 does not vanish 
generally, the current conservation requires F 3 = 0. Next, we show that a term like 
(p' + p) 1 ' can be expressed in terms of y*" and ia^ v q v . 

^""^ = ^[-y"(/-/) + (/-/)y"] 

= i[-2( i5 ' + p)"+2/y" + 2y"/] 
= -(p' + pf +2My>' 
Next, using Eq. (17.15), we calculate /'' A /( . In the nonrelativistic limit, 

tp' + pY ip' + pY'A,, 

2M 2M 



= — A q ■ a + a- (iqx A) = a- B 

where we have used the fact that a goes to zero in the nonrelativistic 1 
no diagonal elements and that iqx A= VxA= B. 
Problem 17.3: Energy-momentum conservation gives 

M\ = Pj = (fe + p - k') 2 = (fe - k') 2 + 2p-(k- fe') + p 2 
= q 2 + 2M(oj - to') + M 2 

Using Q 2 = — q 2 , oj — to' = ^ + M *~ M . For the elastic scattering, 



M x = M , 

Q 2 = -(fe - fe') 2 



Problem 17.4: Since, q f 'L flv = q v L flv = 0, only terms proportional to g 1 "", p^p" ii 
W v do not vanish. 

f l„ { = ] 2[2fc' • fe + 2, 2 ] = 2q 2 = -8Wsin 2 ^ 
pVV = 2 [2(j> • *)(j> • *') + ^y"] 

= 2 2M 2 £0«' - 2M 2 «cy' sin 2 - 
= 4M 2 ««'cos 2 - 
.-. L HV W flv = Acorn' \ W 2 cos 2 - + 2 Wi sin 2 - 
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Substituting !„,, W" in Eq. (17.34a), and using k ■ p = Mm, d 3 fc' = (o' 2 dco'dQ 
_ (Ajta) 2 4jtML flv W' v d 3 fe' 

<7 ~4(fe-i>) 2* (2^7 

= — -w' 2 \ W 2 cos 2 - + 2W!sin 2 - da/dfi 
Q 4 L 2 2 J 

Problem 17.5: Comparing Eq. (17.12) with Eq. (17.41) 

Then using Eq. (17.43), we can obtain Eq. (17.42). 

Alternative Method: Insert J>*(p) = u(p')y>'u(p) in Eq. (17.34c) for W v . 

= E »(pV»(P)"(P)y F tV) = Tr[(/- M)y"(y- M)y"] 
(c.io) T „ ,,, „ ,„ a 2 „..! 

Although 

j, = (M;0), p' = p + q= P F = (E F ; P F ) = (M + 0) - «/; fe - fc') 

q term in ]j' does not contribute because of Eq. (17.39). Considering final state 
density £ F = ^^ , etc. 

w „» = i 2 ^ y d 4, + p 1 y « . v* 



4M 2 



= — a(M + v-B F )|-i 
Considering 

£ F = ^/(k-fe') 2 + M 2 = -/c^Tv 2 " 
we have 



- gg) = M 

v £ F 






Q 2 



= 5 ^ 5 4M 2 



Problem 17.6: Using Eq. (17.30) in the text, x = ^7 < 1- For y it is obvious from 
Eqs. (17.61b). 
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Problem 17.7: Using equality 

3cos0 ~ ~M y ' da/~~w 
that can be derived from Eqs. (17.61), we have 



Considering 

co' 2 = co 2 (l-y) 2 
s = 2wM, v 



(i-y) 



M 2 xy 

: s(i-y) ' 



d 2 a 2jt d 2 cr (17.41) y 4a 2 / s 2 \ , 

L s a-y) J 



Q 4 L V 2 y 

Problem 17.8: £^(0) is a Lorentz vector, and satisfies 



,„ £ '<(0) = a( ? .p-<^ 2 )=0 



Namely, it satisfies the condition to be a photon polarization vector. Furthermore, it 
is normalized as e /( (0) e '' (0) = 1 and is time-like. In fact, using variables evaluated 
in the target protons rest frame, 

Q 

" /( = (M; 0,0,0) , (p-q) = Mv, 



We can show it agrees with Eq. (17.77b) in the text. 

Problem 17.9: Using e(0) in the Problem 17.8, W v can be expressed as 

Using £„(±)£"(0) = 0, e fl (±)g" v e v (±)* = -1 

14ji 2 cl -c-^ , , „. „„ An 2 a 
°T = i - r J2 e fl (X)E v (X)*W v = ~ ir W 1 

x=± 
Air 2 a An 2 a ( 1 \ 

K ?y > »y > k v « 2 M 2 J 

Ajz 2 ol \( v 2 \ 1 
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In deriving the last two equalities, we have used 

£«(0)g" r e,(0)* = 1 , a" 2 = p 2 - *^> = M 2 (l + ^J 

Problem 17.10: Denoting the parton mass as /a, and the energy and n 

of the proton in the Breit frame as (E; 0, 0, P), the parton's momentum transfer q 2 

can be calculated as 

P = (p°;p x ,p Y ,xP) , p'= (p°;p x ,p y ,-xP) ; 



P = jpl+P 2 y + X 2 P 2 +V 2 

q 2 = (j/-p) 2 = -4% 2 P 2 , 6^(0) = (1,0,0,0) 

As the electron scatters with the parton elastically, the structure function W 1 ' 
the parton target is proportional to 



I" = 2U" + f 



(See Eq. (17.34b)). Therefore, the photon-partor 
given by 



a L a £„(0)e„(0)* W" v a I 00 = 2 (ip °p'° + i-J 

= 4(p 2 + /* 2 + x 2 P 2 - x 2 P 2 ) = 4(p 2 + fi 2 ) 
°t«\J2 M^M*)* W» v a Ul xx + I") = 2p xP ; + 2p yP ; - q 2 



= 2 2 + V, e = Q 2 , ••• g = 


A(p 2 T +/i 2 ) 

Q 2 


Problem 17.11: 




d 2 a 4a 2 ,, [ -. B 
— = — -a/ 2 W 2 cos 2 -+2W 1 si 


-I] 


We re-write the contents in [• • • ] as follows 





W 2 + tan 2 -Wi = 2tan 2 - 

(l + g) W 2 - W t + {l + 2 (l + g) tan 2 f j \ 
2 (l + |)tan 2 § 
( Wi + £ W L ) 

Here we used W L = (1 + v 2 /Q 2 ) W 2 - Wi. Inserting 
o/ 2 cos 2 (0/2) 1 to' 1 K 
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in Eq. (1.19) 

d 2 a aK en' 1 

__ = />r + a L ), /- = ___ 

Problem 17.12: First we derive the parton approximation for e. According to 
Eqs. (17.61), v ~ 0{s/M), Q 2 ~ O(s), v 2 » £) 2 , 



r v 2 + o 2 


>ir-[ 


+ 2 (M 


2 M 2 xy ' 


sxy 


's(i-y). 


2(1 -y) 








l + (l-y) 2 








1 1 + (1- 


y) 2 






•• l-e y 2 








Next considering ^ = — i , g 


^ = -20)10 


', we have 




^y= a 9(y ' Q2) 

^ r 3(«w',cos0) 


= 2a)'dco'dc 


OS0 , 




d 2 CT jt d 2 
d£) 2 dy ~ ~aP dca'dQ 


n a 2 K 
~ w 7 2jt 2 Q 2 


m'l + (l- 

w y 2 


y) 2 CTr 


Inserting K/<w ~ v/cu = y 








d 2 CT _ « i + (i-y) 2 


dQ 2 , 
or^dy 






Problem 17.13: Use (C.8c). 









Chapter 18: Gauge Theories 

Problem 18.1: Let the field strength inside the solenoid with radius a be Bo- Since 
the magnetic field is along the z direction, we expect the vector potential in <p — 
tan _1 (y /x) direction. The magnetic field inside the solenoid can be reproduced by 
the potential A in = -y (— y, x) = \B x r — -^pe^ where p = ^x 2 + y 2 . 

The vector potential outside the solenoid should take a form V f(r), and we note 
V0 takes the form 

V x 2 + y 2 ' x 2 + y 2 ' J ~ p C * ' 



The constant fe can be determined by the condition, 
(T) A>ut ■ dr = B n dS = B 7ra 2 
As Aout = (k/p)e<p, d^ r= A- dr = / Q 2jr -pd</> = 2jrfc = jta 2 B , .'. fe = iB a 2 
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k can also be determined by the condition 
An(p = a) = A,ut(P = a) 

Let us consider a function = tan -1 ( ^) , which has a cut at y = 0, x < because 
of periodicity of the angle. differs by 2jT on both sides of the cut. 

V0 = Vtan" 1 (^) = ( ^—,, * , ,0) 

W V * 2 + y 2 * 2 + y 2 / 

(x < 0, y > 0) , tan <p < => <p = jt at y ->• +0 
(x < 0, y < 0) , tan <p > => = -n at y ->• -0 

<Px<0,y^+0 — <Px<0,y^-0 = 2jT 

By a gauge transformation 

A^A' = A + V Z , x = --B o a 2 0(x,y), 0(x,y) = tan" 1 (y/x) 

A' can be made everywhere except at the cut, which is called a Dirac string. 
Problem 18.7: 

D''0 t (D„0)~ &'< (-ig w W -t-i — B- Y) 

Use t = r/2, Y0 = 1 and replace with (0, v/V2) and 
D''<? t (D„0) ~ (0, v/V2)(l/2) 2 

x rgwW° + g B B, V2g w W+ i 2 r 1 
L V2g w W-, -gwW + g B B] \yls/l\ 

= J [ 2 Sw W~W + + (-g w W° + g B B) 2 ] 

_ gwJL w ~w + + ^-z°z° 

4 8 

Considering there is a factor 2 difference in the mass term between charged 
and neutral boson, namely (mass term in the Lagrangian = *j- Ya=i ^l ~ 
m 2 (W"W+ + W° 2 /2))wehavem w = *f^ ( m z = «|i. 



Appendix J 
Particle Data 



Table).! Baryons 



Spin Mass Life time (sec) Principal decays 

(MeV/c 2 ) /width (MeV) 



1/2 


938.272 




1/2 


939.565 




1/2 


1115.68 


2.63 x 10" 


1/2 


1189.37 


8.02 x 10" 


1/2 


1192.64 


7.4 x 10- 2 


1/2 


1197.45 


1.48 x 10" 


1/2 


1314.9 


2.90 x 1(T 


1/2 


1321.7 


1.64 x 1(T 


1/2 


286.5 


2.00 x 10" 


3/2 


1232 


118 MeV 


3/2 


1385 


37 MeV 


3/2 


1533 


9 MeV 


3/2 


1672 


8.2 x 10~ ] 



Elementary Particle Physic: 
Copyright © 2010 WILEY-Vi 
ISBN: 978-3-5;- '" 



916 I Appendix] Particle Data 



Name 


Quark contents 


Spin Mass 


Life time (sec) 


Principal decays 








(MeV/c 2 ) 


/width (MeV) 




n ± 


»rf. f/77 




139.570 


2.60 x 10^ 8 s 


HV, 


.T° 


»77, d~d 




134.977 


8.4 x 10^ 17 s 


VY 


JC± 


111, sli 




493.68 


1.24 x 10" 8 s 


fiVu, nn, tctctc 


K°,K° 


dl, s~d 




497.61 


K s 8.95 x 10~ n 
K L 5.12xl0- 8 s 


Jiftvp.nevt.jtjtn 


</ 


uu, dl, 


s! 


547.85 


1.30 keV 


yy ,71X71 


if 


uu, dl, 


s! 


957.66 


0.20 MeV 


ijTtJi, py 


D± 


cd,dc 




1869.6 


1.04 x 10~ 12 s 


K7iJT,Kftv /t ,Kev c 


D».D° 


cu, uc 




1864.8 


4.1 x 10^ 13 s 


Knn, K/iV/,, Kev e 


D± 


CS, SC 




1968.5 


5.0 x 10^ 13 s 


vp.v'p.<t>p 


,.±-<» 


ud, 1IU, 


dl. dli 


1 775.5 


149 MeV 


7171 


K *,±,0) 


us, ds, s 


71. sli 


1 894 


50 MeV 


A'.T 


<U 


1.77, d~d 




1 782.6 


8.5 MeV 


7C7T7C, Tty 


'/> 


ss 




1 1019.5 


4.3 MeV 


KK, 717171 


;/y 


cc 




1 3097 


93 kcV 


e~e + ,fi~/i + ,pji,cojt 


r 


bb 




1 9460 


54kcV 


e -e+, /l -ft + ,T-r+ 



Appendix K 
Constants 



Table K.I Constants 



Quantity 


Symbol 


Value unit 


Speed of light in vacuum 


c 


2.997925 x 10 8 ms^ 1 


Planck constant 


h 


6.626069 x 10~ 34 J s 


Planck constant, reduced 


h 


1.054572 xl(r 34 J s 

= 6.582119 x 10- 22 MeVs 


Conversion constant 


he 


197.326963 MeVfm a 


Electron charge magnitude 


e 


1.602176 xlO- 19 C 


Electron mass 


m, 


0.510999 MeV/c 2 


Proton mass 


m p 


938.272013 MeV/c 2 

= 1.672622 x 10- 27 kg 


Neutron mass 


m, : 


939.56536 MeV/c 2 


Avogadro constant 


N, 


6.022142 mol -1 


Boltzmann constant 


k 


1.380650 x 10~ 23 J K- 1 
= 8.617343 x 10 _5 eVK _1 


Fine structure constant 


a = e 2 /4ji<E tic 


7.297353 x 10~ 3 
= l/137.036000 b 


Classical electron radius 


r e = ah/m e c 


2.817940 x 10~ 15 m 


Bohr radius 


aoo = {l/a)h/m e c 


0.529177 x 10" 10 m 


Thomson cross section 


a T = 8jrr 2 /3 


0.665245 barn c 


Gravitational constant 


G N 


6.67428 x 10- 11 m 3 kg" 1 s~ 2 
= 6.70881 x 10- 39 /ic(GeV/c 2 r 2 


Planck mass 


jtic/G N 


1.22089 x 10 19 GeV/c 2 


Planck length 


y/tlGnl^ 


1.61624 x 10~ 35 m 


Planck time 


V^Gjv/c 5 


5.39124 x 10~ 44 s 


Fermi coupling constant 


Gj/(Ac) 3 


1.16637 x 10~ 5 GeV- 2 


W boson mass 


m w 


80.398 GeV/c 2 


Z° boson mass 


m z 


91.1876 GeV/c 2 


Weak mixinga angle 


sin 2 6 W d 


0.2231 


Strong coupling constant 


a s (m z ) 


0.1176 


a fm = 10- 15 m. 

b At Q 1 = 0. ~ 1/128 at Q 2 


= m 2 




c bam = l(r 28 m 2 . 






d defined by 1 - m 2 w / m\ . 
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- factor 545 

- force 4, 6, 540 

- independence 542 
-singlet 533,544 
-SL/(3) 541 

Compton scattering 176 

- path integral method 348, 353 
confinement 6, 18, 24, 521, 529, 533, 761 
conjugate 268 









connected diagrams 353 
connection 734-736, 738, 743-744 
conserved charge 96 
conserved isospin current 263 

continuity equation 95,99 

cooling 708 

Cooper pair 5, 19, 750, 767, 781 

correlation 429 

- function 300, 304-305 

-n-point 315 

-two-point 300,318-319 
cosmic microwave background radiation 799 
cosmological constant 799 
Cosmotron 443, 504 
Coulomb gauge 115,357,361,813 
Coulomb potential 



- mixed interference 645 

- mixing parameter e, es.^L 635,644 

- evaluation of 649 

- phase convention 645 

- models of 673 

- milli-strong 674 

- milliweak 675 

- superweak 675 
CPLEAR 622, 664 



- formalism 632 
632 



• 



- of Dirac equation 71 



- formalism 47 

-gauge 115,119 

-vector 42 
:ovariant derivative 

131, 147, 253, 732, 741-742, 744, 756 
:ovariant differential operator, see covariant 

derivative 732 
CP 

621 



- eigenstate 618 
632 



- operator 618 
-2jr,3jrof 619 

- transformation 618 

- transformation property 632 
CP violation 617 

-eT 655 

- asymmetry Ai 640, 643 

- asymmetry As 644 

- decay parameter eo (denoted as e), e 
635-636, 640, 644-650, 659 

- decay parameter 62 . e f 646 

- decay parameter »;+_,>; oo 640 

- value of 642 
-direct 645,651 
-discovery of 630-631 

- experiments on 664 

- formalism 632 

- in the SM model 675 
-indirect 645,651-652 



- foundation of 662 
-theorem 253,258 

- transformation 258, 840 
CPT violation 656 

- decay parameter Bo, B2 661 

- decay parameter y 655,657 

- mixing parameter d 
635-639, 644, 656-662 

critical density 800 
critical energy 395, 397 
cross section 379 
crossing 140, 174—175 

-symmetry 139,494 
Curie temperature 774-775 
current-current coupling 556 



741,751,757-758 

- average 740 

- local 740 

-of space-time 739,741,799 
-of the universe 799 

- total 740 

curvilinear coordinate 734, 742-743 
CVC hypothesis 567,584,592,615 

- experimental test of 587 
cyclotron 371 

- frequency 217, 371, 574 

- isochronous 372 









72 



D meson 535-536 
Dalitz decay 451 
Dalitz plot 478 

- for the decay 4f 

- of K meson 482 
-Of CO 486 



DAPHNE 622 

dark energy 19,122,798 

dark matter 19, 797 

de Broglie wave 25 

de Broglie wavelength 11 

decay amplitude 635 

decomposition 

- of product representations 
decuplet 510 

Dee 371 

deep inelastic scattering 679 

- by electron 687 



- by 



712 
10 data 717 

- QCD correction 725-726 
defect 777 

deficit angle 740, 750 
AI = 1/2 rule 591,647 

- violating parameter m 646 
Am 623,626 

-sign of 623,629 
A S = -A Q components 659 
AS = A£)rule 589,625 
A function 120, 154 
delta functional 357 
A (1232) 457,469 
A i function 120 
Ap, see Feynman propagator 
density effect 386, 400 
depletion layer 414 
derivative coupling 487 
DESY 539 
determinant 

- of the hermitian operator 291, 293 

- of the Klein-Gordon operator 315 
DGLAP evolution formula 728 
diamagnetism 

- perfect 767, 769 
dichromatic beam 708 
differential geometry 735-736, 742 
diffraction pattern 493 , 682 
diffraction scattering 627 
dimension 

- fifth 747 

- of irreducible representation 851, 856 
dimensional regularization 195, 825 
Dirac 

- condition for y matrices 69 

- equation 62 

- equation, another derivation 811 

- equation, Lorentz-covariant 69 

- equation, plane wave solution 71, 817 

- y matrix 69, 817 



direct product 47 

DIS, see deep inelastic scattering 

disconnected diagram 347 

domain 775 

double beta decay 87 

Drell-Yan process 520,721 

- scaling 723 
dressed 212 

drift chamber 412,415,417 
dual tensor 49 
duality 500 



e~e + to fC fx* reaction 174 
e-j.i scattering 167 

- path integral method 351 

- polarized- 171 

e-j.i-x puzzle, see generation puzzle 
effective action 305 
effective range 490 

- approximation 490 
eigen mass in matter 628 
eightfold way 508 
Einstein equation 741 
Einstein-Podolsky-Rosen, see EPR 

paradox 663 



elastic scattering 

- cross section 491 
electric dipole moment 88 

-of the neutron 249 
electron 

- magnetic moment 75 

- self-energy 349-350, 830 
electron holography 751 
electron volt 364 
electrostatic spectrometer 574 
electroweak theory 770 
emittance 368 

emulsion 415,448 
energy-momentum tensor 96, 225 
-complex field 99 

- Dirac field 100 

- electromagnetic field 99 
energy vector 426 

entropy 307, 309 
ep elastic scattering 683 
EPR paradox 663 
equivalent photon 699 

- cross section of 701 





index 


933 


- energy of 701 


fine structure constant 23, 35, 215 




- polarization of 699 


Fitch-Cronin 630 




error matrix 435, 438 


flavor 18 




i] meson 454 


- conservation 554, 579, 598 




Euclidean 736 


- eigenstate 789 




- action 291 


- quark 764 




- space 291, 733 


-S(J(3) 221,509 




- time 303 


Fock space 112 




Euler constant 196 


forbidden transition 558 




Euler-Lagrange equation 90, 92, 223 


force 




exchange force 


- carrier 18-20 




- isospin- 455 


- color exchange 457, 489 




- spin 545-546 


-fundamental 20,733 




extra dimension 793, 796 
/ 


- isospin exchange 457, 487, 489 

- long range 24 
-range 23 




Faddeev and Popov's gauge fixing 


- spin exchange 487, 489 




method 355 


- tensor 489 




Faddeev-Popov ghost 354, 359 


form factor 681 




family problem 606 


- electromagnetic 683 




Fermi 


-examples of 681 




- coupling constant 240, 556 


-ofKL3 595 




- coupling constant, value of 582 


- of the nucleon 683, 686 




- golden rule 127 


formation zone 391, 393, 399 




- interaction 556-557, 559 


four-fermion interaction 555, 557, 559, 611 




-theory 553,555 


FP ghost 354 




- limitation of 610 


Fresnel integral 277 




- transition 559-561 


Friedman equation 799 




Fermi-Dirac statistics 105, 113 


Froissart bound 493 




Fermi-Yang model 462 


ft value 559 




fermion 104 


functional 279 




-loop 205,349 


- calculus 279 




-mass 781 


- derivative 280, 286, 304 




ferromagnetism 774 


-integral 279 




Feynman 


-Taylor expansion 281 




-gauge 116 


-vacuum 305 




- parameter 194 


fundamental representation 755 




- path integral 274, 277 






-propagator 147,151,318 


S 




- propagator in the Coulomb gauge 815 g-2 193,197,214 




- rules 186, 299 


g-factor 50, 65, 77 




-x 722-723,725 


G operation 453 




Feynman diagram 


G parity 453 




- connected- 353 


y matrix 




- disconnected 347, 354 


- trace 173 




- external line 187 


Gamow-Teller transition 240, 559, 561 




- internal line 187-188 


gas electron multiplier, see GEM 413 




- irreducible 212 


gauge 




- loop diagram 188-189 


- condition 93 




-reducible 212 


- field 732 




-vertex 187 


-fixing 116,321,356 




final state interaction 478 


-invariance 205,321,613,750,825 




fine structure 65,546 


- chiral 587 





-principle 731 
-sector 783 



- symmetry 114 

- theory 729 

- non-Abelian 754 

-global 731 

-local 252,732 

-2nd kind 732 

-1st kind 731 
gauging 732, 745, 747, 755 
Gaussian distribution 430 
Gaussian integral 277, 284, 314 
Gauss's law 796 
Geiger-Marsden 446 
Geigermode 411 

Gell-Mann matrix 506,542,761,844 
Gell-Mann-Okubo mass formula 

513-514,551 
GEM 413 

general relativity 733, 741, 744-745 
generating functional 300, 305, 315, 318-319 
-forQED 342,345 

- of interacting fields 320 

- of the connected Green's function 306 

- of the Dirac field 332 

- of the electromagnetic field 323 

- of the scalar field 312 

- Taylor expansion 319 
generation 18 

- how many 606 
-puzzle 605,786-788 

generator 754 
-ofO(3) 40 
-ofSU(N) 262 

- of the gauge transformation 252 

- of the group 805, 841-842 

- of the Lorentz group 53-55, 70 

- of the Poincare group 57 

- of the rotation 54, 264 

- of the transformation 224 

- of the translation 227 
geodesic line 733-734,736-737,743 
geometrical interpretation of gravity 733 
GIM mechanism 536, 598, 601 
Ginzburg-Landau parameter 770 
Ginzburg-Landau theory 767, 769, 781, 789 
Glashow-Weinberg-Salam theory 782 
global symmetry 252 

gluon 20,24,536,541 
:eof 720 
720 
Goldstone boson 777-778, 780 



Goldstone theorem 778 

goodness of fit 433 

Gordon equality 193-194, 684, 817 

grand unified theory 7, 221, 788-789, 791 

Grassmann number 324 

- complex conjugate 324 

- d function 328 

- derivative 325 

- Gaussian integral 328, 330 

- integral 326 
gravitino 795 
graviton 20, 747, 795 

Green's function 132, 151-152, 272, 297, 303 
-advanced 132,153 

- Feynman's 312 
-retarded 132,153 

group 803 

-Abelian 803 

- adjoint representation 842, 846 

- conjugate representation 843 

- connectedness 805, 842 

- direct product 843,847 

- fundamental representation 842, 846 

- generators 805 
-irreducible 512 

- irreducible representation 846 

- Lie group 804 

- nonabelian 803 

- 0(3) 805 

- O(N) 804 
-rank of the 843 
-reducible 512,843 

- representation 804 

- SI(2,Q 809 
constant 805 
constants of SU(3) 844 

- subgroup 804 

- U(N) 804, 841 

GUT, see also grand unified theory 792 

- scale 792 

GWS theory, see Glashow-Weinberg-Salam 
theory 782 



hadron 4, 18, 167 
hadronic shower 401 
hadronization 532 
Hamiltonian formalism 90, 100 
heavy electron 579 
Heisenberg 

- equation of motion 100-101, 127 
-picture 101,127-128 

- uncertainty principle 69 



helicity 58,228 

-conservation 161,174-175 

- eigenstate 61, 172 
-of electron 569-570 
-of neutrino 576 

- operator 61 

- suppression 566 
Helmholtz free energy 306, 309 

- free particle 308 

- harmonic oscillator 308 
hierarchy problem 790, 792, 796 
Higgs 

-field 779 

- mechanism 6, 778-782, 788 

- particle 6, 19, 782, 788 
-sector 783 

Hilbert space 267 

hologram 752 

horn 706 

Hubble 801 

Huygens' principle 273 

hypercharge 509, 771 

-oiv L ,e L ,e R 772 

-of Higgs 782 

- of leptons 772 

- of quarks 772 

- operator 772 
hyperfme structure 64, 546 
hypernucleus 483 
hyperspace 734, 738-739, 751 



I spin see Isospin 
ideal mixing 517 
ILC 377 

imaginary time 302 
impact parameter 445 
impulse approximation 693 

- conditions for 696 
in state 132, 334 

index of refraction 628 

inertial frame 743 

infinite momentum frame 696 

inflation 801 

inflation model 792 

infrared 

- compensation 199 

- divergence 184, 192, 194, 197, 200 
instanton 762 

interacting fields 131,147 
interaction picture 127 
interferogram 752 



internal degrees of freedom 17 
468 

n chamber 403,411 

nloss 381 

n mode 410 
IOO symmetry 505 
isoscalar target 714 
isospin 18 

- conservation 454 

- current 264, 584-585, 755 

- degree of freedom 457 

- multiplets 260 

- of antiparticles 265 

- operator 754 

- symmetry 260 



J/ip 526,535 

- isospin and G-parity 527 

- mass and quantum number 527 
-width 533 

Jacobi identity 759 
jet 533 



K factor 724 






i 



Kabir asymmetry 655 
Kaluza-Klein theory 747, 795 
kaon decay 589 

- time development of 638 



- 



; 



K13 decay 592 

- form factor 595 
Klein-Gordon equation 25, 32 
Klein-Nishina formula 181 
KLOE 664 
klystron 371 

KM matrix 597, 600-601, 617, 676, 791 
Kobayashi-Maskawa matrix, see KM matrix 

597,675 
KTeV 666 
Kurie plot 573 



LAB frame, see laboratory frame 
laboratory frame 136 
Lagrangian 

- complex field 93 

- density 92 
-Dirac field 93 

- electromagnetic field 93 

- formalism 89, 223 



-real field 93 

- SU(2) 760 

- SU(2)x U(l) 111 
Lamb shift 213 






■ 



Landau distribution 385 

Lande g-factor 217 

Larmor frequency 217 

lattice QCD 761 

law of inertia 733,744 

law of large numbers 428 

leading log approximation 762-763 

least squares method 438 

LEP 369, 374 

lepton 18 

-flavor 579,605 

- number 

- number a 
Levi-Civita tensor 47, 819 
LHC 364, 368, 374 

Lie algebra 804, 841 

- 0(3) 40 

- Poincare group 57 
Lie group 804 

- compact 805 

- noncompact 805 
light guide 405 

limited streamer mode 411 

linac 370 

linear collider 377 

linear operator 290 

Livingston chart 11 

LLA, see leading log approximation 762 

long-range force 5 

longitudinal gluon 764 

Lorentz 

-boost 42,53,70 

- condition 322 

- contraction 694 
-gauge 115,357,359 

- generators for spin 1/2 particle 70 
-invariance 37,193,221,227 
-operator 53,70 

- operator for spin 1/2 particle 809 

- scalars 45 
-tensor 47 
-vector 41,44 



Lorentz transformation 41, 

- homogeneous 41 

- inhomogeneous 56 

- orthochronous 46, 12( 
-proper 46,119-120 

LPM effect 399-400 
LS force 546-547 
luminosity 367 

427 



MAC-E 574-575 
magnetic flux 750, 767 
-line 753 

- quantization 749-750, 753 

- tube 767 
magnetic moment 

- of the baryon 525 
-of the proton 524 

magnetic spectrometer 574 

magnon 778 

Majorana 

-phases 791 
Majorana particle 84 
Maki-Nakagawa-Sakata matrix 791 
Mandelstam variable 138, 140, 171, 366 

-formula 547,549,551 

-of the gauge boson 779 

-ofW, Z 783 
mass absorption coefficient 396 
mass hierarchy 786-788 

619-620, 625-626, 628, 633, 636, 859 
-eigenvalue 621 

- symmetry constraint 634 
mass operator 619 

matter field 27 

matter particle 18 

maximum likelihood method 433 

Meissner effect 5, 750, 753, 767-770, 781 

meson 18 

-mixing 516 

- octet 846 
meson exchange 

- scalar 164 
metric 734, 736 

metric tensor 41-42, 734, 736, 747 
Michel parameter 583, 604 
732 
loss 385 



- matrix element 134, 169 

- phase space 145 



-space 41,291,303,734,744 

- time 303 
mirror operation 236 
missing energy 427 
Moliere unit 399 
momentum transfer 680 
monopole 792 

Mott scattering 157, 184, 192, 213 

- path integral method 345 
multiple scattering 389 

multiwire proportional chamber 411, 415 
muon 448 

- decay 579, 584 

MWPC, see multiwire proportional chamber 



Noether's theorem 95,222,225,731 
non-Abelian 754 

- field strength 758 

-gauge theory 100,195,753-760 
nonet 507, 510, 522 
normal distribution 430 
normal ordered 149 
November Revolution 525 
v-q scattering 715 
nucleon 



n photon emission 204 

n-point function 318 

N product, see normal ordered 

NA48 666 

Nambu 775 

Nambu-Goldstone boson 778 

narrow band neutrino beam 707 

natural unit 33 

naturalness 792-793 

NBB, see narrow band neutrino beam 

negative energy 26, 64 

negative pressure 126, 801 

negative probability 26 

neutral current 598, 773 

- flavor-changing 601 
neutralino 795, 798 
neutrino 444, 448, 571 

-atmospheric 790 

- beams 705 

- detector 709 

- Dirac neutrino 87 

- experiments how to 705 

- factory 708 

- helicity 566 

- helicity measurement 576 

- Majorana neutrino 86-87, 791 

- mass measurement 572,576 

- oscillation 18, 35, 62, 554, 606, 789 

- reactor neutrino 571 

- right-handed 86, 789 

- second neutrino 578 

- solar 790 

- Weyl neutrino 87 
neutron 447 

- bottle 250 
Nishijima-Gell-Mann's law 

502, 504, 509, 590, 771, 782 



number operator 






0(3) 37,40 
octet 507, 509-510 
- baryon 509 



on shell 150, 207 

order parameter 769, 775 

out state 132, 334 



P, see also parity 559 
- violation 559, 562 

560,564,567 



parallel transport 

734-736, 738, 740, 742, 756-757 
parity 77 

- intrinsic 233 
-ofjr + 449 
-ofjr 450 

- tensor method 451 

- of the photon 234 

- operation 77 
-transformation 42,46,62,233,8 

- two-body system 235 
parity violation 239,559-560,562 

- in the weak interaction 238 
partial wave 

- analysis 462, 472 

- cross section 466 

- expansion 229 

- parity 464, 470 

- S-matrix unitarity 464 

- scattering matrix 468 
particle democracy 679 



particle identification 419 
partition function 306 
parton 680 

- distribution function 697, 725 

- kinematics 694 

- luminosity 723 

- model 693 

- in hadron collisions 721 
-variable 695-696 

passive transformation 38 
path integral 267,311 

- determinant 315 

- expression 276 

- harmonic oscillator 285 

- matrix method 287 
-normalization 315 

- operator method 290 

- scattering matrix 294 

- calculation of 297 
Pauli 444 

- matrix 61 
Pauli-Dirac 

- representation 62, 818 
Pauli-Lubanski operator 57 
Pauli-Villars regularization 195 
penetration length 770 
penguin diagram 677 

PEP 529 

phase convention 622, 640 

ph; 

ph; 

<j> factory 622 

photomultiplier 405 

photon 

-longitudinal 117,699 

-scalar 117,700 

- self-energy 349, 825 

- soft 199 









shift 464 

stability 376 

transformation 252, 738, 749, 754 



n meson 449 
jt° to 2/ 520 
pion cloud 584 
pion decay 564 

- constant 520, 565 
Planck 

- energy 23 

- scale 4, 795 
plasma frequency 123 



plastic scintillator 405 
Poincare transformation 746 
Poincare group 56 
Poisson distribution 204, 43 1 
polarization 

- in scattering 465 
-ofnucleon 465 

-sum 117-119,178-179,822 
Pomeranchuk 

- theorem 492, 498 

- trajectory 498 
Pomeron 498 
positron 448 
positronium 527, 529 
principle of detailed balance 247 
principle of equivalence 743 
Proca equation 93 

propagator, see also Feynman propagator 
127,272 

- free particle 282 
proper time 43 
proportional mode 411 
proton decay 792 

proton synchrotron, see PS 370 
PS 443 



Q- value 526 

QCD 7, 20, 760-764 

- background 368-369 

- coupling constant 529, 763-764 

- Lagrangian 761 

- lattice 769 

- potential 528 
QED 4,167 

- path integral method 340 
-test 213 

quantization 

-offermion 104 
-of fields 89,105-106,111 
-of harmonic oscillator 102 
-oftheDirac field 112 

- of the electromagnetic field 114 

- in the Coulomb gauge 814 
quantum anomaly 791, 795 
quantum chromodynamics, see QCD 
quantum electrodynamics, see QED 

- counting 605 

- electric charge 18 

- electric charge by D I S data 718 
-mass 19,550 



-model 501,509,787,841 
- potentials of 24, 527 



quintessence 



R-parity 795 
R-value 520-521 
radiation length 390, 395, 401 
radiative correction 191,527 



-pseudo- 53 
reduction formula 
-Dirac fields 337 

- electromagnetic fields 337 
-LSZ 320,333,340 
-scalar fields 333 

regeneration 626 

- pole 495 

- trajectory 494, 527, 529 
Reggeon 499 
relativistic 

- cross section 141 
-kinematics 136 

- normalization 142 

- wave equation 25 
renormalizability 784 
renormalizable theory 611 
renormalization 191,208 

-constant 133 

- group equation 765 

- mass- 208 

- scattering amplitude 211 
renormalized coupling constant 2 



p meson 475 

Ricci's curvature tensor 99 

RICH 407 

Riemann's tensor 741 

Rosenblufh formula 685 



- 2-body- 227 

- jr N scattering 465 
matrix 174,229,833,835 



- orthogonal condition 229 

- operator 229 
rotational symmetry 737 
running coupling constant 

204, 208, 762-765, 792 
Rutherford 446 

- formula 446 

- model 3 

- scattering 135, 389, 680 

- scattering formula 136 



rfx, see also scattering matr 
n the path integral 297 



scalar product 44 
scaling 690, 723 

- Bjorken 691 
-limit 691,698,713 

- variables 696 
scattering length 490 
scattering matrix 127 

- path integral method 351 

- Iif 231 
Schrodinger 

- equation 271, 278 
-picture 101,128 

scintillation 
screening 764 






• 



sea of negative-energy 66 
sea quark 716 
second quantization 112 
see-saw mechanism 791 
short-range force 5 
singlet 507 
SLC 369,371,377 
solid state detector 413 
space-time translation 223 
spectator 368 
spectroscopy 443 
- hadron 457 
spherical harmonic function 235, 244, 833 
spin 



-ofjr 

-of.T 1 






- tensor method 451 
- operator 63 
spin-orbit coupling 551 
spin-statistics 119 






• 



te of 807 



- representation 803 
spontaneous symmetry breaking 

5,750,774-778,792 
SPS 374 
spurion 592 
SSD 413,415 
standard deviation 429 
standard model 7, 731 

- beyond the 789-799 
static approximation 488 
statistical mechanics 306 
strange particle 501 

- decay of 589 
strangeness 501-502 

- eigenstate 618 

- oscillation 622 
string model 500 

- of hadron 532-533 

strong CP problem 762, 798 
754, 842 
•e function 40, 688, 699-700 

- definition of 689 

- isoscalar target 714 
-v 712 

-v 712 

-of neutrino 712 

- of point particle 689 

- physical interpretation of 699 

- scaling limit 698 
SU{2) 505,615,805 

- symmetry 501 
-tensor 510 

SU{2) x 1/(1) 783 

SU{2) x 1/(1) gauge theory 770, 774 

SL/(3) 501,505,841,844,857 

- color 510 
-flavor 510 

- rank of 506 
-singlet 511,513 

506 



- symmetry 516 
-tensor 510 

SL/(3) co i or x SU(2) L x 1/(1),, 731 
SU(A) 537,857 
SL/(5) 792,857 
SL/(6) 522,857 

- wave function 523 
SL/(6)xO(3) 525 
SU(N) 841 



Sudakov 

- double logarithm 198 

- form factor 204 
sum over history 274 
sum rule 719 
superbeam 708 
superconductivity 781 
superconductor 

- type I 767, 770 
-type II 767,770 

supercurrent 750, 769 
supergravity 747, 793 
superstring 747 
superstring model 795 
supersymmetry 747, 792 
superweak model 675 
superweak phase 637, 647, 650 

- parameter e 649 
SUSY, see supersymmetry 792 
SusyGUT 793 

symmetry 

- continuous 222 

- discrete 233 

- family 606 

- horizontal 606, 791 
-internal 221,251-252 
-list of 222 

- spacetime symmetry 221 
synchrotron 373-374 

- oscillation 376 

- radiation 369, 376 



T, see also time reversal 839 
621 
560,595,632 

- transformation 9, 839 

- violation 

- asymmetry At 655 
-evidence of 653 

- parameter €j 655 
T product 134,151 

x-e puzzle 483, 562 
r lepton 

- coupling type of 604 

- decay 601 

- mass and spin of 602 

- production of 601 
technicolor 793 
temporal gauge 357, 361 
tensor analysis 851 
tensor method 485 

test of T and CPT in 



Tevatron 366 

Thomas precession 217-218 
Thomson model 445 
Thomson scattering 381 
Thomson, J.J. 444-445 
three-jet 721 

time evolution operator 130, 133, 272 
time projection chamber 420 
time reversal 46, 240 
-ofS-matrix 242 

- of scalar and vector fields 244 

- of the Dirac field 246 

- operation 241 

- quantum mechanics 240 

- test 247 



240, 564, 566-567, 579, 583, 595, 608 

V particles 502 

-energy 121-122,348,776,801 

- fluctuation 69,121,123 

- has negative pressure 126 

- polarization 213, 762, 764 

- properties of 126 

vacuum expectation value 777, 781, 784 
vacuum to vacuum transition 349 
valence quark 716 
variance 429 



- Wigner's 241 


- axial 233 


TOF 420 


-polar 233 


Tomonaga-Schwinger-Feynman 193, 213 


Veneziano model 500, 795 


top condensate 19 


vertex 148, 151, 187 


total reflection 46 


- correction 191, 213 


TPC see time projection chamber 


- diagram 198 


traces of the y matrices 818 


- function 127, 147 


transition operator 134 


virtual particle 150 


transition radiation 387, 390, 392 




- counter 408 


w 


translational operator 268 


W 


translational symmetry 737 


- longitudinal polarization 613 


tree approximation 191 


Ward-Takahashi identity 210 


triangular diagram 520 
Troizk-Mainz experiment 573 


Watson-Sommerfeld transformation 494 


wavelength shifter 404 


two point function 152 
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